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COMPOSITIONS, ORGANISMS AND METHODOLOGIES 
EMPLOYING A NOVEL HUMAN KINASE 

[0001] The present application incorporates by reference U.S. Provisional Application 
Serial No. 60/417,155 filed October 10, 2002 and entitled "Composition, Organisms and 
Methodologies Employing a Novel Human Kinase." 

FIELD OF THE INVENTION 

[0002] The present invention relates to compositions, organisms and methodologies 
employing a novel human protein kinase, NIMA-related human kinase 1 (NRHK1), which 
has sequence homology to the catalytic domain of tyrosine protein kinases and 
serine/threonine protein kinases, and is distantly related to NIMA-related protein kinases. 
This invention can be used for diagnosing, prognosing and treating kinase-related diseases 
and, in particular, diseases associated with aberrant expression of NRHK1. 

BACKGROUND OF THE INVENTION 

[0003] Protein kinases regulate many different cell proliferation, differentiation, and 
signaling processes by adding phosphate groups to proteins. Uncontrolled signaling has 
been implicated in a variety of disease conditions including inflammation, cancer, 
arteriosclerosis, and psoriasis. Reversible protein phosphorylation is the main strategy for 
controlling activities of eukaryotic cells. It is estimated that more than 1,000 of the 10,000 
proteins active in a typical mammalian cell are phosphorylated. The high energy phosphate, 
which drives activation, is generally transferred from adenosine triphosphate molecules 
(ATP) to a particular protein by protein kinases and removed from that protein by protein 
phosphatases. Phosphorylation occurs in response to extracellular signals (hormones, 
neurotransmitters, growth and differentiation factors, etc.), cell cycle checkpoints, and 
environmental or nutritional stresses. The phosphorylation process is roughly analogous to 
turning on a molecular switch. When the switch goes on, the appropriate protein kinase 
activates a metabolic enzyme, regulatory protein, receptor, cytoskeletal protein, ion channel 
or pump, or transcription factor. 

[0004] The kinases comprise the largest known protein group, a superfamily of 
enzymes with widely varied functions and specificities. They are usually named after their 



substrate, their regulatory molecules, or some aspect of a mutant phenotype. With regard to 
substrates, the protein kinases may be roughly divided into two groups: those that 
phosphorylate tyrosine residues (protein tyrosine kinases, PTK) and those that 
phosphorylate serine or threonine residues (serine/threonine kinases, STK). A few protein 
kinases have dual specificity and phosphorylate threonine and tyrosine residues. Almost all 
kinases contain a similar 250-300 amino acid catalytic domain. The primary structure of the 
kinase domains is conserved and can be further subdivided into 1 1 subdomains. The N- 
terminal of the kinase domain, which contains subdomains I-IV, generally folds into a lobe- 
like structure that binds and orients the ATP (or GTP) donor molecule. The C terminal of 
the kinase domain forms a larger lobe, which contains subdomains VI-XI, binds the protein 
substrate and carries out the transfer of the gamma phosphate from ATP to the hydroxyl 
group of a serine, threonine, or tyrosine residue. Subdomain V spans the two lobes. Each of 
the 1 1 subdomains contains specific residues and motifs or patterns of amino acids that are 
characteristic of that subdomain and are highly conserved. 

[0005] The kinases may be categorized into families by the different amino acid 
sequences (generally between 5 and 1 00 residues) located on either side of, or inserted into 
loops pf, the kinase domain. These added amino acid sequences allow the regulation of 
each kinase as it recognizes and interacts with its target protein. 

[0006] The presence of a phosphate moiety modulates protein function in multiple ways. 
A common mechanism involves changes in the catalytic properties (Vmax and Km) of an 
enzyme, leading to its activation or inactivation. 

[0007] A second widely recognized mechanism involves promoting protein-protein 
interactions. An example of this is the tyrosine autophosphorylation of the ligand-activated 
EGF receptor tyrosine kinase. This event triggers the high-affinity binding to the 
phosphotyrosine residue on the receptor's C-terminal intracellular domain to the SH2 motif 
of an adaptor molecule Grb2. Grb2, in turn, binds through its SH3 motif to a second adaptor 
molecule, such as SHC. The formation of this complex activates the signaling events that 
are responsible for the biological effects of EGF. Serine and threonine phosphorylation 
events also have been recently recognized to exert their biological function through protein- 
protein interaction events that are mediated by the high-affinity binding of phosphoserine 
and phosphothreonine to the WW motifs present in a large variety of proteins. 
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[0008] A third important outcome of protein phosphorylation is changes in the subcellular 
localization of the substrate. As an example, nuclear import and export events in a large 
diversity of proteins are regulated by protein phosphorylation. 

[0009] Many kinases are involved in regulatory cascades wherein their substrates may 
include other kinases whose activities are regulated by their phosphorylation state. 
Ultimately the activities of some downstream effectors are modulated by phosphorylation 
resulting from activation of such a pathway. 

SUMMARY OF THE INVENTION 

[0010] The present invention discloses compositions, organisms and methodologies 
employing a novel human protein kinase. The new human protein kinase shares sequence 
homology with the consensus sequence of the catalytic domains of serine/threonine protein 
kinases, pkinases, and tyrosine kinases. The new protein also shows homologies to the 
kinase domain of an NIMA-related kinase. The gene encoding this protein is localized near 
locus 9q34 of human chromosome 9. This new gene is hereinafter referred to as NIMA- 
related human kinase 1 (NRHK1) gene, and its encoded protein(s) is referred to as NRHK1 
or NRHK1 kinase. 

[001 1] The kinase domain in NRHK1 shows 1 00% sequence alignment with the 
consensus sequences of the catalytic domains of serine/threonine protein kinases, 100% 
sequence alignment with the consensus sequence of the pkinase domain, 87.5% sequence 
alignment with the consensus sequences of the catalytic domain of tyrosine kinases, and 
28% sequence alignment with the kinase domain of an NIMA-related protein kinase. The 
utilities of various kinase domains are known in the art. The unique peptide sequences, and 
nucleic acid sequences that encode the peptides, can be used as models for the development 
of human therapeutic targets, aid in the identification of therapeutic proteins, and serve as 
targets for the development of human therapeutic agents that modulate kinase activity in 
cells and tissues that express the kinase. 

[0012] In one aspect, the invention provides isolated polynucleotides comprising a 
nucleotide sequence encoding NRHK1 or a variant of NRHK1. 

[0013] In another aspect, the invention provides isolated polypeptides comprising the 
amino acid sequence of NRHK1 or a variant of NRHK1. 
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[0014] In yet another aspect, the invention provides agents that modulate the expression 
level of the NRHK1 gene or an activity of NRHK1 . 

[0015] The invention also provides methods for (a) detecting polynucleotides comprising 
a nucleotide sequence encoding NRHK1 or a variant of NRHK1 and (b) detecting 
polypeptides comprising an amino acid sequence of NRHK1 or a variant of NRHK1 in a 
biological sample. 

[0016] The invention further provides methods for screening agents that modulate 
expression level of the NRHK1 gene or an activity of NRHK1. 

[0017] The invention further provides cell lines harboring the NRHK1 gene, animals 
transgenic for the NRHK1 gene, and animals with interrupted NRHK1 gene (NRHK1 
knockout animals). These cell lines and animals can be used to study the functions of 
NRHK1. 

[0018] In still another aspect, the invention provides polynucleotides capable of inhibiting 
NRHK1 gene expression by RNA interference. 

[0019] The invention further provide methods of inhibiting NRHK1 gene expression by 
introducing siRNAs or other RNAi sequences into target cells. 

[0020] The preferred embodiments of the inventions are described below in the Detailed 
Description of the Invention. Unless specifically noted, it is intended that the words and 
phrases in the specification and claims be given the ordinary and accustomed meaning to 
those of ordinary skill in the applicable art or arts. If any other meaning is intended, the 
specification will specifically state that a special meaning is being applied to a word or 
phrase. 

[0021] It is further intended that the inventions not be limited only to the specific 
structure, material or methods that are described in the preferred embodiments, but include 
any and all structures, materials or methods that perform the claimed function, along with 
any and all known or later-developed equivalent structures, materials or methods for 
performing the claimed function. 

[0022] Further examples exist throughout the disclosure, and it is not applicant's intention 
to exclude from the scope of his invention the use of structures, materials, or methods that 
are not expressly identified in the specification, but nonetheless are capable of performing a 
claimed function. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

[0023] The inventions of this application are better understood in conjunction with the 
following drawings, in which: 

[0024] FIGURE 1 compares amino acid residues 28 to 297 of NRHK1 to the catalytic 
domain of a family of Ser/Thr protein kinases. 

[0025] FIGURE 2 shows the sequence alignment between amino acid residues 28 to 296 
of NRHK1 and the protein kinase domain of pkinases. 

[0026] FIGURE 3 shows the sequence alignment between amino acid residues 59 to 294 
of NRHK1 and the catalytic domain of a family of tyrosine kinases. 

[0027] FIGURE 4 illustrates the sequence alignment between amino acid residues 25 to 
285 of NRHK1 and amino acid residues 1 to 246 of an NIMA-related protein kinase. 

[0028] FIGURE 5 shows the hydrophobicity profile of NRHK1 . 
DETAILED DESCRIPTION OF THE INVENTION 

[0029] The following detailed description is presented to enable any person skilled in the 
art to make and use the invention. For purposes of explanation, specific nomenclature is set 
forth to provide a thorough understanding of the present invention. However, it will be 
apparent to one skilled in the art that these specific details are not required to practice the 
invention. Descriptions of specific applications are provided only as representative 
examples. Various modifications to the preferred embodiments will be readily apparent to 
one skilled in the art, and the general principles defined herein may be applied to other 
embodiments and applications without departing from the spirit and scope of the invention. 
The present invention is not intended to be limited to the embodiments shown, but is to be 
accorded the widest possible scope consistent with the principles and features disclosed 
herein. 

[0030] The present invention is based on the sequence information obtained from a newly 
developed genomic prediction pipeline. Briefly, the X-ray crystal structures of the catalytic 
domains of protein kinases were collected and aligned together according to their structural 
identity/similarities. The alignment was converted into a "scoring matrix" which carried the 
structural profile of the kinase catalytic domains. This scoring matrix was then used to 
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search the Celera Human Genome database and pull out sequences that have kinase 
catalytic domains. 

[0031] Based on this analysis, the present invention provides the amino acid sequence of a 
human kinase peptide containing a kinase domain that is highly homologous to the 
consensus sequences of the catalytic domain of serine/threonine protein kinases, cDNA 
sequences and genomic sequences that encode the kinase peptide, and information about the 
closest art known protein/peptide/domain that has structural or sequence homology to the 
kinase of the present invention. 

[0032] The peptide of the present invention may be used for the development of 
commercially important products and services. Various aspects of the invention are 
described in detail in the following subsections. The use of subsections is not meant to limit 
the invention. Each subsection applies to any aspect of the invention. 

Definitions and Terms 

[0033] To facilitate the understanding of the present invention, a number of terms and 
phrases are defined below: 

[0034] As used herein, a polynucleotide or a polypeptide is "isolated" if it is removed 
from its native environment. For instance, a polynucleotide or a polypeptide is isolated 
through a purification process such that the polynucleotide or polypeptide is substantially 
free of cellular material or free of chemical precursors. The polynucleotide/polypeptide of 
the present invention can be purified to homogeneity or other degrees of purity. The level 
of purification will be based on the intended use. As appreciated by one of ordinary skill in 
the art, a polynucleotide/polypeptide can perform its desired function(s) even in the 
presence of considerable amounts of other components or molecules. 
[0035] In some uses, a polynucleotide/polypeptide that is "substantially free of cellular 
material" includes preparations which have less than about 30% (by weight) other 
polynucleotides/polypeptides including contaminating polynucleotides/polypeptides. For 
instance, the preparations can have less than about 20%, less than about 10%, or less than 
about 5% other polynucleotides/polypeptides. If a polynucleotide/polypeptide preparation 
is recombinantly produced, it can be substantially free of culture medium, i.e., culture 
medium components representing less than about 20% by weight of the 
polynucleotide/polypeptide preparation. 
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[0036] The language "substantially free of chemical precursors" includes preparations in 
which the polynucleotide/polypeptide is separated from chemical precursors or other 
chemicals that are involved in the synthesis of the polynucleotide/polypeptide. In one 
embodiment, the language "substantially free of chemical precursors" includes kinase 
preparations having less than about 30% (by weight), less than about 20% (by weight), less 
than about 10% (by weight), or less than about 5% (by weight) chemical precursors or other 
chemicals used in the synthesis. 

[0037] As used in the present invention, a polynucleotide introduced into a cell is an 
isolated polynucleotide. Likewise, a polypeptide expressed from an introduced vector in a 
cell is also an isolated polypeptide. 

[0038] A "polynucleotide" can include any number of nucleotides. For instance, a 
polynucleotide can have at least 10, 20, 25, 30, 40, 50, 100 or more nucleotides. A 
polynucleotide can be DNA or RNA, double-stranded or single-stranded. A polynucleotide 
encodes a polypeptide if the polypeptide is capable of being transcribed and/or translated 
from the polynucleotide. Transcriptional and/or translational regulatory sequences, such as 
promoter and/or enhancer(s), can be added to the polynucleotide before said transcription 
and/or translation occurs. Moreover, if the polynucleotide is singled-stranded, the 
corresponding double-stranded DNA containing the original polynucleotide and its 
complementary sequence can be prepared before said transcription and/or translation. 
[0039] As used herein, "a variant of a polynucleotide" refers to a polynucleotide that 
differs from the original polynucleotide by one or more substitutions, additions, and/or 
deletions. For instance, a variant of a polynucleotide can have 1, 2, 5, 10, 15, 20, 25 or 
more nucleotide substitutions, additions or deletions. Preferably, the modifications) is in- 
frame, i.e., the modified polynucleotide can be transcribed and translated to the original or 
intended stop codon. If the original polynucleotide encodes a polypeptide with a biological 
activity, the polypeptide encoded by a variant of the original polynucleotide variants 
substantially retains such activity. 

[0040] Preferably, the biological activity is reduced/enhanced by less than 50%, or more 
preferably, less than 20%, relative to the original activity. 
[0041] A variant of a polynucleotide can be a polynucleotide that is capable of 
hybridizing to the original polynucleotide, or the complementary sequence thereof, under 
reduced stringent conditions, preferably stringent conditions, or more preferably, highly 



7 



stringent conditions. Examples of conditions of different stringency are listed in Table 1. 
Highly stringent conditions are those that are at least as stringent as conditions A-F; 
stringent conditions are at least as stringent as conditions G-L; and reduced stringency 
conditions are at least as stringent as conditions M-R. As used in Table 1 , hybridization is 
carried out under a given hybridization condition for about 2 hours, followed by two 15- 
minute washes under the corresponding washing condition(s). 
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Table 1. St rin gency Co nditions 



Stringency 
Condition 


roiy- 
nucleotide . 

Pfvhrirl 
nyuriu 


Hybrid 
^ength (bp) 1 


Hybridization 
Temperature and Buffer" 


wasn lemp. 
and Buffer" 


A 


DNA:DNA 


>50 


65°C; lxSSC -or- 
4?°C; IxSSC. 50% formamide 


CO U.jaooL/ 


T} 

D 




<50 


T R *; lxSSC 


T R *; IxSSC 


c 


DNA:RNA 


>50 


67°C; lxSSC-or- 
4S°C: IxSSC. 50% formamide 


0/ L, U.jajoV^ 


D 


TYM A -PAT A 


<50 


T n *; lxSSC 


Tn*; lxSSC 


E 


RNA:RNA 


>50 


70°C; lxSSC -or- 
S0°C: IxSSC. 50% formamide 




r 




<50 


T F *; IxSSC 


T f *; lxSSC 


G 


DNA:DNA 


>50 


65°C; 4xSSC -or- 
42°C; 4xSSC, 50% formamide 


CO IXooL 


TT 


TYNJ A -TYNT A 


<50 


T H *; 4xSSC 


T H *; 4xSSC 


I 


DNA:RNA 


>50 


67°C; 4xSSC -or- 
45°C; 4xSSC, 50% formamide 




T 
J 


TYM A -PAT A 


<50 


Tj*; 4xSSC 


Tj*; 4xSSC 


K 


RNA:RNA 


>50 


70°C; 4xSSC -or- 
S0°C: 4xSSC. 50% formamide 


Of C ? IXooL 


T 

L 




<50 


TV*: 2xSSC 


T T *; 2xSSC 


M 


DNA:UNA 




50°C; 4xSSC -or- 
40°C: 6xSSC. 50% formamide 


50°C; 2xSSC 


N 


DNA:DNA 


<50 


T N *; 6xSSC 


T N *; 6xSSC 


0 


DNA:RNA 


>50 


55°C; 4xSSC -or- 
4?°C: 6xSSC. 50% formamide 


55°C; 2xSSC 


P 


DNA:RNA 


<50 


T P *; 6xSSC 


T P *; 6xSSC 


Q 


RNA-.RNA 


>50 


60°C; 4xSSC -or- 
45°C: 6xSSC. 50% formamide 


60°C; 2xSSC 


R 


RNA:RNA 


<50 


T R *; 4xSSC 


T R *; 4xSSC 



'• The hybrid length is that anticipated for the hybridized region(s) of the hybridizing 
polynucleotides. When hybridizing a polynucleotide to a target polynucleotide of unknown 
sequence, the hybrid length is assumed to be that of the hybridizing polynucleo ide. When 
polynucleotides of known sequence are hybridized, the hybrid length can be determined by 
aligning the sequences of the polynucleotides and identifying the region or regions of 
optimal sequence complementarity. 

H - SSPE (lxSSPE is 0.15M NaCl, lOmM NaH 2 P0 4 , and 1.25mM EDTA, pH 7.4) can be 
substituted for SSC (lxSSC is 0.1 5M NaCl and 15mM sodium citrate) in the hybridization 
and wash buffers. 

T * . Xr *. The hybridization temperature for hybrids anticipated to be less than 50 'base 
pairs in length should be 5-1 0°C less than the melting temperature (T m ) of the hybrid, where 
T m is determined according to the following equations. For hybrids less than 18 base pairs 
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in leneth T m (°C) = 2(# of A + T bases) + 4(# of G + C bases). For hybrids between 18 and 
49 base pairs in length, T m (°C) = 81.5 + 16.6(log 10 Na + ) + 0.41(%G + C) - (600/N), where N 
is the number of bases in the hybrid, and Na + is the concentration of sodium ions in the 
hybridization buffer (Na + for lxSSC = 0.1 65M). 

[00421 It will be appreciated by those of ordinary skill in the art that, as a result of the 
degeneracy of the genetic code, there are many polynucleotide variants that encode the 
same polypeptide. Some of these polynucleotide variants bear minimal sequence homology 
to the original polynucleotide. Nonetheless, polynucleotides that vary due to differences in 
codon usage are specifically contemplated by the present invention. 

[0043] As used herein, a "polypeptide" can include any number of amino acid residues. 
For instance, a polypeptide can have at least 5, 10, 15, 20, 30, 40, 50 or more amino acid 
residues. 

[0044] As used herein, a "variant of a polypeptide" is a polypeptide that differs from 
the original polypeptide by one or more substitutions, deletions, and/or insertions. 
Preferably, these modifications do not substantially change (e.g., reduce or enhance) the 
original biological function of the polypeptide. For instance, a variant can reduce or 
enhance or maintain the biological activities of the original polypeptide. Preferably, the 
biological activities of the variant is reduced or enhanced by less than 50%, or more 
preferably, less than 20%, relative to the original polypeptide. 

[0045] Similarly, the ability of a variant to react with antigen-specific antisera can be 
enhanced or reduced by less than 50%, preferably less than 20%, relative to the original 
polypeptide. These variants can be prepared and evaluated by modifying the original 
polypeptide sequence and then determining the reactivity of the modified polypeptide with 
the antigen-specific antibodies or antisera. 

[0046] Preferably, a variant polypeptide contains one or more conservative 
substitutions. A "conservative substitution" is one in which an amino acid is substituted for 
another amino acid which has similar properties, such that one skilled in the art would 
expect that the secondary structure and hydropathic nature of the substituted polypeptide 
will not be substantially changed. Conservative amino acid substitutions can be made on 
the basis of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity and/or 
the amphipathic nature of the residues. Negatively charged amino acids include aspartic 
acid and glutamic acid, and positively charged amino acids include lysine and arginine. 
Amino acids having uncharged polar head groups and similar hydrophilicity values include 
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leucine, isoleucine and valine, or glycine and alanine, or asparagine and glutamine, or 
serine, threonine, phenylalanine and tyrosine. Other groups of amino acids that can produce 
conservative changes include: (1) ala, pro, gly, glu, asp, gin, asn, ser, thr; (2) cys, ser, tyr, 
thr; (3) val, ile, leu, met, ala, phe; (4) lys, arg, his; and (5) phe, tyr, trp, his. A polypeptide 
variant can also contain nonconservative changes. 

[0047] Polypeptide variants can be prepared by the deletion and/or addition of amino 
acids that have minimal influence on the biological activity, immunogenicity, secondary 
structure and/or hydropathic nature of the polypeptide. Variants can be prepared by, for 
instance, substituting, modifying, deleting or adding one or more amino acids residues in 
the original sequence. Polypeptide variants preferably exhibit at least about 70%, more 
preferably at least about 90%, and most preferably at least about 95% sequence homology 
to the original polypeptide. 

[0048] Polypeptide variants include polypeptides that are modified from the original 
polypeptides either by a natural process, such as a post-translational modification, or by a 
chemical modification. These modifications are well known in the art. Modifications can 
occur anywhere in the polypeptide, including the backbone, the amino acid side-chains and 
the amino or carboxyl termini. It will be appreciated that the same type of modification can 
be present in the same or varying degrees at several sites in a given polypeptide. Also, a 
given polypeptide can contain many types of modifications. Polypeptides may be branched, 
for example, as a result of ubiquitination, and they may be cyclic, with or without 
branching. Cyclic, branched, and branched cyclic polypeptides can result from natural post- 
translational processes or be made through synthetic methods. Suitable modifications for 
this invention include acetylation, acylation, ADP-ribosylation, amidation, covalent 
attachment of flavin, covalent attachment of a heme moiety, covalent attachment of a 
nucleotide or nucleotide derivative, covalent attachment of a lipid or lipid derivative, 
covalent attachment of phosphatidylinositol, cross-linking, cyclization, disulfide bond 
formation, demethylation, formation of covalent cross-links, formation of cysteine, 
formation of pyroglutamate, formylation, gamma-carboxylation, glycosylation, GPI anchor 
formation, hydroxylation, iodination, methylation, myristoylation, oxidation, pegylation, 
proteolytic processing, phosphorylation, prenylation, racemization, selenoylation, sulfation, 
transfer-RNA mediated addition of amino acids to proteins such as arginylation, and 
ubiquitination. 
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[0049] As used herein, the term "modulation" includes up-regulation, induction, 
stimulation, potentiation, inhibition, down-regulation or suppression, or relief of inhibition. 

[0050] A nucleotide sequence is "operably linked" to another nucleotide sequence if 
the two sequences are placed into a functional relationship. For example, a coding sequence 
is operably linked to a 5' regulatory sequence if the 5' regulatory sequence can initiate 
transcription of the coding sequence in an in vitro transcription/translation system or in a 
host cell. "Operably linked" does not require that the DNA sequences being linked are 
contiguous to each other. Intervening sequences may exist between two operably linked 
sequences. 

[0051] As used herein, a "disease-free" human refers to a human who does not have 
NRHK1 -related diseases. Disease-free cells, tissues or samples refer to cells, tissues or 
samples obtained from such disease free human(s). 

[0052] A polynucleotide is "capable of hybridizing" to a gene if the polynucleotide can 
hybridize to at least one of the following sequences: (1) the sequence of an RNA transcript 
of the gene, (2) the complementary sequence of an RNA transcript of the gene, (3) the 
cDNA sequence of an RNA transcript of the gene, (4) the complementary sequence of the 
cDNA sequence of an RNA transcript of the gene, (5) a genomic sequence of the gene, and 
(6) the complementary sequence of a genomic sequence of the gene. 

[0053] As used herein, sequence "identity" in an alignment can be determined by the 
standard protein-protein BLAST program (blastp), the standard nucleotide-nucleotide 
BLAST program (blastn) or the BLAST2 Sequence program. Suitable BLAST programs 
can be found at the web site maintained by the National Center of Biotechnology 
Information (NCBI), National Library of Medicine, Washington, DC, USA. 

Human NRHK1 Gene and NRHK1 Kinase 

[0054] The present invention identifies a new human gene (NRHK1 gene) that encodes 
a protein containing sequences homologous to the consensus sequences of the catalytic 
domain of serine/threonine kinases, pkinases, and tyrosine kinases. The new protein is also 
distantly related to an NIMA-related protein kinase. The nucleotide sequence encoding 
NRHK1 and the amino acid sequence of NRHK1 are depicted in SEQ ID NOS:l and 2, 
respectively. The NRHK1 gene is localized in locus 9q34 of human chromosome 9. 
Specifically, the NRHK1 gene is located between genes LOCI 57890 and LOC57109, and 
overlaps with gene LOCI 69436. 
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[0055] Human chromosome locus 9q34 and the neighboring regions have been 
associated with multiple diseases, including but are not limited to, colon cancer, prostate 
cancer, head and neck squamous cell carcinoma, lung adenocarcinoma, bladder cancer, 
renal cell carcinoma, T cell non-Hodgkin's lymphoma, insulinoma, childhood adrenocortical 
tumors, chronic myeloid leukemia, tuberous sclerosis, the nail-patella syndrome, lattice 
dystrophy Type II, primary torsion dystonia, congenital generalized lipodystrophy, and 
juvenile amyotrophic lateral sclerosis. 

[0056] Human NRHK1 gene has 21 exons. The exons are mapped to the nucleotide 
sequences of human chromosome 9 in Celera genomic database (SEQ ID NO:3). Exons 1- 
29, 31 and 32 are also mapped to nucleotides 1 10001 to 137201 in human chromosome 9 of 
the Entrez Human Genome Sequence Database maintained by NCBI. Table 2 lists the 
location of each of these 21 exons in the genomic sequence SEQ ID NO:3. Table 2 also 
illustrates the corresponding location of each exon in the NRHK1 -coding sequence SEQ ID 
NO:l. 

Table 2. Exons in Human NRHK1 Gene 



Exon Numbers 


Corresponding Sequence in SEQ 
ID NO:3 


Corresponding Sequence in 
SEQ ID NO:l 


1 


1-87 


1 -87 


2 


2563 - 2469 


88-174 


3 


8734 - 8778 


175-219 


4 


10495 - 10569 


220 - 294 


5 


12325 - 12426 


295 - 396 


6 


14404 - 14474 


397 - 467 


7 


15547-15662 


468 - 583 


8 


18710- 18828 


584 - 702 


9 


20019-20182 


703 - 866 


10 


21583 -21841 


867- 1125 


11 


23864-23951 


1126-1213 


12 


24833 - 24949 


1214-1330 


13 


25277-25418 


1331 - 1472 


14 


26158-26298 


1473 - 1613 


15 


26640 - 26725 


1614- 1699 



13 



16 


27298 - 27432 


1700- 1834 


17 


27841 - 27930 


1835 - 1924 


18 


28115 - 28243 


1925 -2053 


19 


28335 - 28463 


2054-2182 


20 


29204 - 29344 


2183 -2323 


21 


29667 - 29836 


2324 - 2493 



[0057] A conserved domain search using RPS-BLAST program (RPS-BLAST 2.2.3 
[April 24, 2002], available at the BLAST web site maintained by NCBI), showed that 
NRHK1 contained sequences homologous to the consensus sequences of several protein 
kinase domains. 

[0058] Specifically, the amino acid residues 28 to 297 of NRHK1 are highly 
homologous to a catalytic domain of a family of Ser/Thr protein kinases (Entrez accession 
number: smart00220). This kinase family includes C-Jun N-terminal kinase (JNK3), 
abelson tyrosine kinase, a calmodulin-binding, vesicle-associated, protein kinase-like 
protein (1G5), serine/threonine-protein kinase prp4, Cdc2/Cdc28 subfamily of Ser/Thr 
protein kinases in C. elegans, and ribosomal S6 kinase of C elegans. FIGURE 1 shows that 
the two sequences share 100.0% alignment with a score of 130 bits and an E value of 2 x 
10~ 31 . As used in other figures of this invention, "Query" denotes to the sequence of 
NRHK1, and "Sbjct" refers to the sequence being compared to the NRHK1 sequence. 

[0059] FIGURE 2 shows that the amino acid residues 28 to 296 of NRHK1 also 
aligned 100.0% with the protein kinase domain of pkinase (Entrez accession number: 
pfam00069). The alignment has a score of 1 19 bits, and an E value of 3 x 10" 28 . This 
pkinase family includes protein kinase Ck2, wee 1 -like protein kinase (WEElhu), and 
tyrosine-protein kinase RYK. 

[0060] FIGURE 3 shows the sequence alignment between amino acid residues 59 to 
294 of NRHK1 and the N-terminal of a family of tyrosine kinases (Entrez accession 
number: smart00219). This family includes the tyrosine kinase domain of fibroblast growth 
factor receptor 1, tyrosine-protein kinase (KIN15/KIN16 subfamily), and a Drosophila 
receptor protein-tyrosine kinase family member (drl-Pl). The amino acid residues 59-294 
in NRHK1 's kinase domain have 87.5% sequence identities to smart00219, with a score of 
76.4 bits and an E value of 4 x 10" 15 . 
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[0061] FIGURE 4 illustrates the sequence alignment between amino acid residues 25 to 
285 of NRHK1 and the N-terminal of a Populus x canescens NIMA-related protein kinase 
(Entrez accession number: AF469649_1). The two sequences share 28% sequence 
identities with a score of 87.8 bits and an E value of 5 x 10" 16 . NIMA-related kinases are a 
group of protein kinases sharing high amino acid sequence identities with NIMA (never in 
mitosis gene A) which control mitosis in Aspergillus nidulans. Specifically, NIMA 
regulates mitotic chromatin condensation through phosphorylation of histone H3 at serine 
1 0. NIMA-related kinases may also regulate the cell cycle in other eukaryotes, as 
expression of NIMA can promote mitotic events in yeast, frog or human cells. Moreover, 
dominant-negative versions of NIMA can adversely affect the progression of human cells 
into mitosis. Human Nek6 (hNek6), a member of the mammalian NIMA-related kinases is 
localized to human chromosome 9q33-34, the same region in chromosome 9 where NRHK1 
is located. 

[0062] FIGURE 5 shows the hydrophobicity profile of NRHK1 . The hydrophobicity 
analysis indicates that NRHK1 kinase is not likely a membrane or transmembrane protein. 

[0063] NRHK1 shows significant sequence homology to a human protein kinase-like 
protein SGK071 (Entrez accession number: AX056458, the nucleotide sequence and the 
amino acid sequence of SGK071 are depicted in SEQ ID NOS:5 and 6, respectively), which 
was disclosed in PCT patent application WO00/73469. Analysis using pairwise BLAST 
algorithm revealed that NRHK1 and SGK071 share 84% sequence identities at the amino 
acid level (blastp, matrix: BLOSUM62, gap open: 11, Gap extension: 1, x_dropoff: 50, 
expect: 10.0, wordsize: 3, filter: unchecked), and 90% sequence identities at nucleotide 
level (blastn, match: 1, mismatch: -2, gap open: 5, gap extension: 0, x_dropoff: 50, expect: 
10.0, wordsize: 11, filter: unchecked). 

[0064] The existence and expression of the NRHK1 gene in humans are supported by 
various EST sequences. For instance, nucleotides 1 - 583 of SEQ ID NO:l are supported by 
the EST sequences disclosed under GenBank accession numbers BI458908, AL044935, 
BM015576, AI799141, and Incyte accession number 6854652H1; nucleotides 584 - 866 of 
SEQ ID NO:l are supported by the EST sequences disclosed under GenBank accession 
numbers AV763896, AI936591, AW1 83662, and AI554166; nucleotides 867 - 949 of SEQ 
ID NO:l are supported by the EST sequences disclosed under GenBank accession number 
AW629230 and Incyte accession numbers 6854652Hland 6883792H1; nucleotides 1565 - 
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1707 of SEQ ID NO:l are supported by the EST sequences disclosed under Incyte 
accession number 6140228F8; nucleotides 2426 - 2493 of SEQ ID NO:l are supported by 
the EST sequences disclosed under GenBank accession number BQ 184985. 

Utility of Protein Kinases 

[0065] Protein kinases are involved in the regulation of many critical biological 
processes such as signal transduction pathways. Malfunctions of cellular signaling have 
been associated with many diseases. Regulation of signal transduction by cytokines and 
association of signal molecules with protooncogenes and tumor suppressor genes have been 
the subjects of intense research. Many therapeutic strategies can now be developed through 
the synthesis of compounds which activate or inactivate protein kinases. 
[0066] The importance of kinases in the etiology of diseases has been well established. 
Kinase proteins are a major target for drug action and development. A January 2002 survey 
of ongoing clinical trials in the USA revealed more than 100 clinical trials involving the 
modulation of kinases. Trials are ongoing in a wide variety of therapeutic indications 
including asthma, Parkinson's, inflammation, psoriasis, rheumatoid arthritis, spinal cord 
injuries, muscle conditions, osteoporosis, graft versus host disease, cardiovascular disorders, 
autoimmune disorders, retinal detachment, stroke, epilepsy, ischemia/reperfusion, breast 
cancer, ovarian cancer, glioblastoma, non-Hodgkin's lymphoma, colorectal cancer, non- 
small cell lung cancer, brain cancer, Kaposi's sarcoma, pancreatic cancer, liver cancer, and 
other tumors. Numerous kinds of modulators of kinase activity are currently in clinical 
trials including antisense molecules, antibodies, small molecules, and even gene therapy. 
Accordingly, it is valuable to the field of pharmaceutical development to identify and 
characterize previously unknown members of the kinase family proteins. The present 
invention advances the state of the art by providing previously unidentified human kinase 
proteins which are structurally related to NIMA-related protein kinases. 
[0067] Many therapeutic strategies are aimed at critical components in signal 
transduction pathways. Approaches for regulating kinase gene expression include specific 
antisense oligonucleotides for inhibiting post-transcriptional processing of the messenger 
RNA, naturally occurring products and their chemical derivatives to inhibit kinase activity 
and monoclonal antibodies to inhibit receptor linked kinases. In some cases, kinase 
inhibitors also allow other therapeutic agents additional time to become effective and act 
synergistically with current treatments. 
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[0068] Among the areas of pharmaceutical research that are currently receiving a great 
deal of attention are the role of phosphorylation in transcriptional control, apoptosis, protein 
degradation, nuclear import and export, cytoskeletal regulation, and checkpoint signaling. 
The accumulating knowledge about signaling networks and the proteins involved will be 
put to practical use in the development of potent and specific pharmacological modulators 
of phosphorylation-dependent signaling. The rational structure-based design and 
development of highly specific kinase modulators is becoming routine and drugs that 
intercede in signaling pathways are becoming a major class of drug. The functions of some 
of the kinases are described below. 

[0069] The second messenger dependent protein kinases primarily mediate the effects 
of second messengers such as cyclic AMP (cAMP), cyclic GMP, inositol triphosphate, 
phosphatidylinositol, 3, 4,5 -triphosphate, cyclic-ADPribose, arachidonic acid, diacylglycerol 
and calcium-calmodulin. The cyclic-AMP dependent protein kinases (PKA) are important 
members of the STK family. Cyclic-AMP is an intracellular mediator of hormone action in 
all prokaryotic and animal cells that have been studied. Such hormone-induced cellular 
responses include thyroid hormone secretion, Cortisol secretion, progesterone secretion, 
glycogen breakdown, bone resorption, and regulation of heart rate and force of heart muscle 
contraction. PKA is found in all animal cells and is thought to account for the effects of 
cyclic-AMP in most of these cells. Altered PKA expression is implicated in a variety of 
disorders and diseases including cancer, thyroid disorders, diabetes, atherosclerosis, and 
cardiovascular disease. 

[0070] Calcium-calmodulin (CaM) dependent protein kinases are also members of the 
STK family. Calmodulin is a calcium receptor that mediates many calcium regulated 
processes by binding to target proteins in response to the binding of calcium. The principle 
target protein in these processes is CaM dependent protein kinases. CaM-kinases are 
involved in regulation of smooth muscle contraction (MLC kinase), glycogen breakdown 
(phosphorylase kinase), and neurotransmission (CaM kinase I and CaM kinase II). CaM 
kinase I phosphorylates a variety of substrates including the neurotransmitter related 
proteins synapsin I and II, the gene transcription regulator, CREB, and the cystic fibrosis 
conductance regulator protein, CFTR. CaM II kinase also phosphorylates synapsin at 
different sites, and controls the synthesis of catecholamines in the brain through 
phosphorylation and activation of tyrosine hydroxylase. Many of the CaM kinases are 
activated by phosphorylation in addition to binding to CaM. The kinase may 
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autophosphorylate itself, or be phosphorylated by another kinase as part of a "kinase 
cascade". 

[0071] Another ligand-activated protein kinase is 5'-AMP-activated protein kinase 
(AMPK) Mammalian AMPK is a regulator of fatty acid and sterol synthesis through 
phosphorylation of the enzymes aeetyl-CoA carboxylase and hydroxymethylglutaryl-CoA 
reductase and mediates responses of these pathways to cellular stresses such as heat shock 
and depletion of glucose and ATP. AMPK is a heterotrimeric complex compnsed of a 
catalytic alpha subunit and two non-catalytic beta and gamma subunits that are believed to 
regulate the activity of the alpha subunit. Subunits of AMPK have a much wider 
distribution in non-lipogenic tissues, such as brain, heart, spleen, and lung, than expected. 
This distribution suggests that AMPK's functions may extend beyond regulation of hpid 
metabolism alone. 

[0072] The mitogen-activated protein kinases (MAP) are also members of the STK 
family. MAP kinases also regulate intracellular signaling pathways. They mediate signal 
transduction from the cell surface to the nucleus via phosphorylation cascades. Several 
subgroups have been identified, and each manifests different substrate specificities and 
responds to distinct extracellular stimuli. MAP kinase signaling pathways are present m 
mammalian cells as well as in yeast. The extracellular stimuli that activate mammalian 
pathways include epidermal growth factor (EOF), ultraviolet light, hyperosmolar medium, 
heat shock, endotoxic lipopolysaccharide (LPS), and pro-inflammatory cytokines, such as 
tumor necrosis factor (TNF) and interleukin-1 (IL-1). 

[0073] EGF receptor is found in over half of breast tumors unresponsive to hormone. 
EOF is found in many tumors, and EGF may be required for tumor cell growth. Antibodies 
to EGF blocked the growth of tumor xenografts in mice. An antisense oligonucleotide for 
amphiregulin inhibited growth of a pancreatic cancer cell line. 

[0074] Tamoxifen, a protein kinase C inhibitor with anti-estrogen activity, is currently 
a standard treatment for hormone-dependent breast cancer. The use of this compound may 
increase the risk of developing cancer in other tissues such as the endometrium. Raloxifene, 
a related compound, has been shown to protect against osteoporosis. The tissue specificity 
of inhibitors must be considered when identifying therapeutic targets. 
[0075] Signal transduction to the nucleus in response to extracellular stimulus by a 
growth factor involves the mitogen activated protein (MAP) kinases. MAP kinases are a 
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family of protein serine/threonine kinases which mediate signal transduction from 
extracellular receptors or heat shock, or UV radiation. Cell proliferation and differentiation 
in normal cells are under the regulation and control of multiple MAP kinase cascades. 
Aberrant and deregulated functioning of MAP kinases can initiate and support 
carcinogenesis. Insulin and IGF-1 also activate a mitogenic MAP kinase pathway that may 
be important in acquired insulin resistance occurring in type 2 diabetes. 
[0076] Many cancers become refractory to chemotherapy by developing a survival 
strategy involving the constitutive activation of the phosphatidylinositol 3-kinase-protein 
kinase B/Akt signaling cascade. This survival signaling pathway thus becomes an 
important target for the development of specific inhibitors that would block its function. PI- 
3 kinase/Akt signaling is equally important in diabetes. The pathway activated by RTKs " 
subsequently regulates glycogen synthase 3 (GSK3) and glucose uptake. Since AKT has 
decreased activity in type 2 diabetes, it provides a therapeutic target. 
[0077] Protein kinase inhibitors provide much of our knowledge about in vivo 
regulation and coordination of physiological functions of endogenous peptide inhibitors. A 
pseudosubstrate sequence within PKC acts to inhibit the kinase in the absence of its lipid 
activator. A PKC inhibitor, such as chelerythrine, acts on the catalytic domain to block 
substrate interaction, while calphostin acts on the regulatory domain to mimic the 
pseudosubstrate sequence and block ATPase activity, or to inhibit cofactor binding. 
[0078] Although some protein kinases have, to date, no known system of physiological 
regulation, many are activated or inactivated by autophosphorylation or phosphorylation by 
upstream protein kinases. The regulation of protein kinases also occurs during the 
transcription, post-transcription, and post-translation processes. The mechanism of post- 
transcriptional regulation is alternative splicing of precursor mRNA. For example, protein 
kinase C (31 and (311 are two isoforms of a single PKC P gene derived from differences in the 
splicing of the exon encoding the C-terminal 50-52 amino acids. Splicing can be regulated 
by a kinase cascade in response to peptide hormones, such as insulin and IGF-1 . PKC pi 
and PII have different specificities for phosphorylating members of the mitogen activated 
protein (MAP) kinase family, for glycogen synthase 3 P , for nuclear transcription factors, 
such as TLS/Fus, and for other nuclear kinases. By inhibiting the post-transcriptional 
alternative splicing of PKC pil mRNA, PKC pil-dependent processes are inhibited. 
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[0079] The development of antisense oligonucleotides to inhibit the expression of 
various protein kinases has been successful. Antisense oligonucleotides are short lengths of 
synthetically manufactured, chemically modified DNA or RNA designed to specifically 
interact with mRNA transcripts encoding target proteins. The interaction of the antisense 
moiety with mRNA inhibits protein translation and, in some cases, post-transcriptional 
processing (e.g., alternative splicing and stability) of mRNA. Antisense oligonucleotides 
have been developed to alter alternative splicing of mRNA forms for inhibiting the 
translation of PKCa. 

[0080] Protein kinase C isoforms have been implicated in cellular changes observed in 
the vascular complications of diabetes. Hyperglycemia is associated with increased levels 
of PKCa and 0 isoforms in renal glomeruli of diabetic rats. Oral administration of a PKCP 
inhibitor prevented the increased mRNA expression of TGF-pi and extracellular matrix 
component genes. Administration of the specific PKCP inhibitor (LY333531) also 
normalized levels of cytokines, caldesmon, and hemodynamics of retinal and renal blood 
flow. Overexpression of the PKCp isoform in the myocardium resulted in cardiac 
hypertrophy and failure. The use of LY333531 to prevent adverse effects of cardiac PKCP 
overexpression in diabetic subjects is under investigation. The compound is also in Phase 
I/II clinical trials for diabetic retinopathy and diabetic macular edema indicating that it may 
be pharmacodynamically active. 

[0081] PRK (proliferation-related kinase) is a serum/cytokine inducible STK that is 
involved in regulation of the cell cycle and cell proliferation in human megakaroytic cells. 
PRK is related to the polo (derived from human polo gene) family of STKs implicated in 
cell division. PRK is down-regulated in lung tumor tissue and may be a proto-oncogene 
whose deregulated expression in normal tissue leads to oncogenic transformation. Altered 
MAP kinase expression is implicated in a variety of disease conditions including cancer, 
inflammation, immune disorders, and disorders affecting growth and development. 
[0082] DNA-dependent protein kinase (DNA-PK) is involved in the repair of double- 
strand breaks in mammalian cells. This enzyme requires ends of double stranded DNA or 
transitions from single-stranded to double-stranded DNA in order to act as a 
serine/threonine kinase. Cells with defective or deficient DNA-PK activity are unable to 
repair radiation induced DNA double-strand breaks and are consequently very sensitive to 
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the lethal effects of ionizing radiation. Inhibition of DNA-PK has the potential to increase 
the efficacy of anti-tumor treatment with radiation or chemotherapeutic agents. 
[0083] The cyclin-dependent protein kinases (CDKs) are another group of STKs that 
control the progression of cells through the cell cycle. Cyclins are small regulatory proteins 
that act by binding to and activating CDKs that then trigger various phases of the cell cycle 
by phosphorylating and activating selected proteins involved in the mitotic process. CDKs 
are unique in that they require multiple inputs to become activated. In addition to the 
binding of cyclin, CDK activation requires the phosphorylation of a specific threonine 
residue and the dephosphorylation of a specific tyrosine residue. 

[0084] Cellular inhibitors of CDKs also play a major role in cell cycle progression. 
Alterations in the expression, function, and structure of cyclin and CDK are encountered in 
the cancer phenotype. Therefore, CDKs may be important targets for new cancer 
therapeutic agents. 

[0085] Chemotherapy resistant cells tend to escape apoptosis. Under certain 
circumstances, inappropriate CDK activation may even promote apoptosis by encouraging 
the progression of the cell cycle under unfavorable conditions, i.e., attempting mitosis while 
DNA damage is largely unrepaired. 

[0086] Purines and purine analogs act as CDK inhibitors. Flavopiridol is a flavonoid 
that causes 50% growth inhibition of tumor cells at 60 nM. It also inhibits EGFR and 
protein kinase A. Flavopiridel induces apoptosis and inhibits lymphoid, myeloid, colon, 
and prostate cancer cells grown in vivo as tumor xenografts in nude mice. 
[0087] Staurosporine and its derivative, UCN-01 , in addition to inhibiting protein 
kinase C, inhibit cyclin B/CDK (IC50 = 3 to 6 nM). Staurosporine is toxic, but its 
derivative 7-hydroxystaurosporine (UCN-01) has anti-tumor properties and is in clinical 
trials. UCN-01 affects the phosphorylation of CDKs and alters the cell cycle checkpoint 
functioning. These compounds illustrate that multiple intracellular targets may be affected 
as the concentration of an inhibitor is increased within cells. 

[0088] Protein tyrosine kinases, PTKs, specifically phosphorylate tyrosine residues on 
their target proteins and may be divided into transmembrane, receptor PTKs and non- 
transmembrane, non-receptor PTKs. Transmembrane protein tyrosine kinases are receptors 
for most growth factors. Binding of a growth factor to the receptor activates the transfer of 
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a phosphate group from ATP to selected tyrosine side chains of the receptor and other 
specific proteins. Growth factors (GF) associated with receptor PTKs include; epidermal 
GF, platelet-derived GF, fibroblast GF, hepatocyte GF, insulin and insulin-like GFs, nerve 
GF, vascular endothelial GF, and macrophage colony stimulating factor. 
[0089] Since RTKs stimulate tumor cell proliferation, inhibitors of RTKs may inhibit 
the growth and proliferation of such cancers. Inhibitors of RTKs are also useful in 
preventing tumor angiogenesis and can eliminate support from the host tissue by targeting 
RTKs located on vascular cells, such as blood vessel endothelial cells and stromal 
fibroblasts. For example, VEGF stimulates endothelial cell growth during angiogenesis, 
and increases the permeability of tumor vasculature so that proteins and other growth 
factors become accessible to the tumor. Broad-spectrum antitumor efficacy of an oral 
dosage form of an inhibitor of VEGF signaling has been reported. Thus, inhibition of 
VEGF receptor signaling presents an important therapeutic target. An extracellular receptor 
can also be a target for inhibition. For example, the EGF receptor family and its ligands are 
overexpressed and exist as an autocrine loop in many tumor types. 

10090] Increasing knowledge of the structure and activation mechanism of RTKs and 
the signaling pathways controlled by tyrosine kinases provided the possibility for the 
development of target specific drugs and new anti-cancer therapies. Approaches towards 
the prevention or interception of deregulated RTK signaling include the development of 
selective components that target either the extracellular ligand-binding domain or the 
intracellular substrate binding region. 

[0091] The most successful strategy to selectively kill tumor cells is the use of 
monoclonal antibodies (mAbs) that are directed against the extracellular domain of RTKs, 
which are critically involved in cancer and are expressed at the surface of tumor cells. In 
the past years, recombinant antibody technology has made an enormous progress in the 
design, selection and production of newly engineered antibodies. It is also possible to 
generate humanized antibodies, human-mouse chimeric or bispecific antibodies for targeted 
cancer therapy. Mechanistically, anti-RTK mAbs might work by blocking the ligand- 
receptor interaction and therefore inhibiting ligand-induced RTK signaling and increasing 
RTK down-regulation and internalization. In addition, binding of mAbs to certain epitopes 
on the cancer cells may induce immune-mediated responses, such as opsonization and 
complement-mediated lysis, and trigger antibody-dependent cellular cytotoxicity by 
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macrophages or natural killer cells. In recent years, it became evident that mAbs control 
tumor growth by altering the intracellular signaling pattern inside the targeted tumor cell, 
leading to growth inhibition and/or apoptosis. In addition, bispecific antibodies can bridge 
selected surface molecules on a target cell with receptors on an effector cell, thus triggering 
cytotoxic responses against the target cell. Despite the toxicity that has been seen in clinical 
trials of bispecific antibodies, advances in antibody engineering, characterization of tumor 
antigens and immunology might help to produce rationally designed bispecific antibodies 
for anti-cancer therapy. 

[0092] Another promising approach to inhibiting aberrant RTK signaling is to develop 
small molecule drugs that selectively interfere with the intrinsic tyrosine kinase activity and 
thereby block receptor autophosphorylation and activation of downstream signal 
transducers. The tyrphostins, which belong to the quinazolines, are one important group of 
such inhibitors that compete with ATP for the ATP binding site at the receptor's tyrosine 
kinase domain and some members of the group have been shown to specifically inhibit the 
EGFR. Potent and selective inhibitors of receptors involved in neovascularization have 
been developed and are now undergoing clinical evaluation. New classes of tyrosine kinase 
inhibitors (TKIs) with increased potency and selectivity, higher in vitro and in vivo efficacy 
and decreased toxicity have been developed using the advantages of structure-based drug 
design, crystallographic structure information, combinatorial chemistry and high-throughput 
screening. 

[0093] Recombinant immunotoxins provide another possibility of target-selective drug 
design. Recombinant immunotoxins are composed of a bacterial or plant toxin either fused 
or chemically conjugated to a specific ligand, such as the variable domains of the heavy and 
light chains of mAbs or to a growth factor. Immunotoxins may contain bacterial toxins, 
such as Pseudomouas exotoxin A or diphtheria toxin, or plant toxins, such as ricin A or 
clavin. These recombinant molecules can selectively kill their target cells when internalized 
after binding to cell surface receptors of the target cells. 

[0094] The use of antisense oligonucleotides represents another strategy to inhibit the 
activation of receptor protein kinases (RTKs). Antisense oligonucleotides are short pieces 
of synthetic DNA or RNA that are designed to interact with the mRNA to block the 
transcription and thus the expression of the target proteins. Antisense oligonucleotides 
interact with the mRNA by Watson-Crick base-pairing and are therefore highly specific to 
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the target protein. Several preclinical and clinical studies suggest that antisense therapy 
might be therapeutically useful for the treatment of solid tumors. 
[00951 The potential of RTKs and their relevant signaling as selective anti-cancer 
targets for therapeutic intervention has been recognized. As a consequence, a variety of 
successful target specific drugs such as mAbs and RTK inhibitors have been developed and 
are currently being evaluated in clinical trials. Table 3 summarizes the most successful 
drugs against receptor tyrosine kinase signaling which are currently evaluated in cluneal 
phases or have already been approved by the FDA. 

T? uu 3 RTK Drugs Currently Under Clinical Evaluation 



RTK 


Drug 


Company 


Description 


Status 


EGFR 




AstraZeneca 


TKI that inhibits EGFR 
signaling 


Phase III 


EGFR 


Cetuximab C225 


ImClone 
Systems 


Mab directed against EGFR 


Phase III 


EGFR 


EGF fusion protein 


Seragen 


Recombinant diphtheria 
toxin-hEGF fusion protein 


Phase II 


HER2 


Trastuzumab 
Herceptin 


Genetech 


Mab directed against HER2 


Approved 
by the FDA 
in 1998 


IGF-IR 


INX-4437 


INEX USA 


Antisense oligonucleotides 
targeting IGR-IR 


Phase I 


VEGFR 


SU5416 


SUGEN 


TKI that inhibits VEGFR2 


Phase II 


VEGFR/ 
FGFR/ 
1 PDGFR 


SU6668 


SUGEN 


RTK inhibition of VEGFR, 
FGFR, and PDGFR 


Phase I 



[0096] Non-receptor PTKs lack transmembrane regions and, instead, form complexes 
with the intracellular regions of cell surface receptors. Receptors that function through non- 
receptor PTKs include those for cytokines, hormones (growth hormone and prolactm) and 
antigen-specific receptors on T and B lymphocytes. 

[0097] Many of the PTKs were first identified as the products of mutant oncogenes in 
cancer cells where their activation was no longer subject to normal cellular controls. In fact, 
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about one third of the known oncogenes encode PTKs, and it is well known that cellular 
transformation (oncogenesis) is often accompanied by increased tyrosine phosphorylation 
activity. 

[0098] Many tyrosine kinase inhibitors, such as flavopiridol, genistem, erbstatin, 
lavendustin A, staurosporine, and UCN-01, are derived from natural products. Inhibitors 
directed to the ATP binding site are also available. Signals from RTKs can also be inhibited 
at other target sites such as nuclear tyrosine kinases, membrane anchors (inhibition of 
farnesylation) and transcription factors. 

[0099] Targeting the signaling potential of growth promoting tyrosine kinases such as 
EGFR, HER2, PDGFR, src, and abl, will block tumor growth while blocking IGF-I and 
TRK will interfere with tumor cell survival. Inhibition of these kinases will lead to tumor 
shrinkage and apoptosis. Fkll/KDR and src are kinases necessary for neovascularization 
(angiogenesis) of tumors. Inhibition of these kinases will slow tumor growth and decrease 
metastases. 

[0100] Inhibitors of RTKs suppress tumor development by preventing cell migration, 
invasion and metastases. These drugs are likely to increase the time required for tumor 
progression, and may inhibit or attenuate the aggressiveness of the disease but may not 
initially result in measurable tumor regression. 

[0101] An example of cancer arising from a defective tyrosine kinase is a class of ALK 
positive lymphomas referred to as "ALKomas" which display inappropriate expression of a 
neural-specific tyrosine kinase, anaplastic lymphoma kinase (ALK). 
[0102] Iressa (ZD1839) is an orally active selective EGF-R inhibitor. This compound 
disrupts signaling involved in cancer cell proliferation. The clinical efficacy of this agent 
shows that it is well tolerated by patients undergoing Phase I/II clinical trials. The 
compound has shown promising cytotoxicity towards several cancer cell lines. 
[0103] Since the majority of protein kinases are expressed in the brain, often in a 
neuron-specific fashion, protein phosphorylation must play a key role in the development 
and function of the vertebrate central nervous system. Thus neuron-specific kinases are 
well established as targets for the development of pharmacologically active modulators. 
[0104] NIMA-related kinases are a group of protein kinases sharing high amino acid 
sequence identities with NIMA (never in mitosis gene A), a protein kinase that, in 
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corporation with P 34Cdc2 protein kinase, regulate mitosis in Aspergillus nidulans. Both 
P 34cdc2/cyclin B and NIMA have to be correctly activated before mitosis can be initiated in 
this species, and P 34cdc2/cyclin B plays a role in the mitosis-specific activation of NIMA. 
In addition, both kinases have to be proteolytically destroyed before mitosis can be 
completed. NIMA regulates mitotic chromatin condensation through phosphorylation of 
histone H3 at serine 10. NIMA-related kinases may also regulate the cell cycle in other 
eukaryotes, as expression of NIMA can promote mitotic events in yeast, frog or human 
cells. In Xenopus oocytes, NIMA induces germinal vesicle breakdown without activating 
Mos, CDC2, or MAP kinase. In HeLa cells, dominant-negative versions of NIMA 
adversely affects the progression into mitosis by causing a specific G2 arrest. 
[0105] Mutation of threonine 199 (conserved in all NIMA-related kinases) inhibited 
NIMA beta-casein kinase activity and abolished its in vivo function. This site conforms to a 
minimal consensus phosphorylation site for NIMA (FXXT) and is analogous to the 
autophosphorylation site of cyclic-AMP-dependent protein kinases. Mutations in an 
NIMA-related kinase gene, Nekl , cause pleiotropic effects including a progressive 
polycystic kidney disease in mice. 

[0106] Within the human NIMA-related kinase (Nek) family, the Nek2 protein kinase is 
the closest known mammalian relative of the NIMA of Aspergillus nidulans. The two 
kinases share 47% sequence identity over their catalytic domains and display a similar cell 
cycle-dependent expression peaking at the G2 to M phase transition. It was found that 
recombinant Nek2 is active as a serine/threonine protein kinase and may undergo 
autophosphorylation. Both human Nek2 and fungal NIMA phosphorylate a similar, albeit 
not identical, set of proteins and synthetic peptides, and beta-casein was found to be a 
suitable substrate for assaying Nek2 in vitro. In HeLa cells, the Nek2 activity parallels its 
abundance, being low during M and Gl but high during S and G2 phase. Human Nek2 
localizes to the centrosome. Overexpression of active Nek2 induces a striking splitting of 
centrosomes, whereas prolonged expression of either active or inactive Nek2 leads to 
dispersal of centrosomal material and loss of a focused microtubule-nucleating activity. 
These results indicate that one function of human Nek2 relates to the centrosome cycle. 
[0107] Human Nek3 was cloned by RT-PCR. A high level of Nek3 expression was 
detected in testis, ovary, and brain, with low-level expression being detected in most of the 
other tissues. Nek3 mRNA was detected in all the proliferating cell lines studied, and the 
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amount did not change during the cell cycle. The human Nek3 gene was assigned to human 
chromosome 13 by somatic cell hybrids and 13ql4.2 by radiation hybrid mapping. 
[0108] Human Nek6 is localized to human chromosome 9q33-34, the same region 
where NRHK1 is located. Nek6 transcripts appear to be expressed in most of the tissues. 
[0109] Human Nek7 is 77% identical to human Nek6. Phylogenetic analysis suggests 
that Nek6, Nek7, and C. elegans F19H6.1 constitute a subfamily within the NIMA family of 
protein kinases. Nek7 expression was restricted to a subset of tissues containing lung, 
muscle, testis, brain, heart, liver, leukocyte, and spleen. The human NEK7 gene was 
assigned to human chromosome 1 by somatic cell hybrids and 1 q3 1 .3 by radiation hybrid 
mapping. 

[0110] Recently, human Nek8, a new human NIMA-related kinase, and its candidate 
substrate Bicd2 were isolated and cloned. Nek8 contains an N-terminal catalytic domain 
homologous to the Nek family of protein kinases, a central domain with homology to 
RCC1, a guanine nucleotide exchange factor for the GTPase Ran, and a C-terminal coiled- 
coil domain. Like Nek2, Nek8 prefers beta-casein over other exogenous substrates, has 
shared biochemical requirements for kinase activity, and is capable of autophosphorylation 
and oligomerization. Nek8 activity is not cell cycle regulated, but like Nek3, levels are 
consistently higher in G(0)-arrested cells. Bicd2 is a protein co-chromatographed with 
Nek8 activity. This protein is a human homolog of the Drosophila protein Bicaudal D, a 
coiled-coil protein. Bicd2 is phosphorylated by Nek8 in vitro, and the endogenous proteins 
associate in vivo. Bicd2 localizes to cytoskeletal structures, and its subcellular localization 
is dependent on microtubule morphology. Treatment of cells with nocodazole leads to 
dramatic reorganization of Bicd2, and correlates with Nek8 phosphorylation. It thus 
appears that both Nek8 and Bicd2 associated with cell cycle independent microtubule 
dynamics. 

[0111] In summary, kinase proteins are a major target for drug action and development. 
Accordingly, it is valuable to the field of pharmaceutical development to identify and 
characterize previously unknown members of kinase proteins. The present invention 
advances the state of the art by providing a previously unidentified human kinase protein 
that has sequence and structure similarities to the catalytic domain of protein kinases. 
Specifically, the kinase domain in NRHK1 shares high sequence identities with the 
corresponding domains in serine/threonine kinases, pkinases, and a family of tyrosine 
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kinases The kinase domain in NRHK1 is also distantly related to the sequences of NIMA- 
related kinases. Functional studies in diverse species have implicated NIMA-related 
kinases in G(2)/M progression, chromatin condensation and centrosome regulate. The 
kinase domain in NRHK1 , either in its native form or in a mutant form, can be used to 
affect the function of the corresponding domain in other kinases. This domain can also be 
used to phosphorylate suitable substrates. The substrate peptides can be conjugated to 
antibodies, and the phosphate groups added to the substrate peptides can be radioactrvely or 
fluorescently labeled. Antibodies thus labeled can be used in various detection assays, as 
appreciated by one of skill in the art. 

[0112] NRHK1 gene and the gene product can be used as a molecular marker for 
diagnosing, prognosing, and monitoring the treatment of disorders related to the aberrant 
expression of NRHK1 . In addition, the NRHK1 gene can be used to screen for potential 
agents or drugs capable of enhancing or inhibiting the NRHK1 gene expression in human 
cells The NRHK1 gene products (polynucleotide and polypeptide) can be used to screen 
for potential agents or drugs capable of enhancing or inhibiting NRHK1 activity. 
Furthermore, various therapeutic methods for treating disorders related to the aberrant 
expression of NRHK1 can be designed based on the NRHK1 gene, its variants, or the 
agents/drugs that affect the expression of the NRHK1 gene or the activity of the NRHK1 
gene products. 

[0113] The following subsections illustrate examples of the utilities of human NRHK1 
gene and NRHK1 kinase. Various changes and modifications within the spirit and scope of 
the present invention will become apparent to those skilled in the art from the present 
description. 

Polynucleotides and Variants Thereof 

[0114] One aspect of the invention pertains to isolated polynucleotide probes capable of 
hybridizing to the NRHK1 gene or its transcripts, such as NRHK1 mRNAs. These probes 
can be used to detect the expression level of the NRHK1 gene in human tissue or cells. The 
present invention also contemplates polynucleotide fragments for use as PGR pnmers for 
the amplification or mutation of the NRHK1 gene or the NRHK1 kinase-coding sequences. 
Another aspect of the invention pertains to isolated polynucleotides that encode NRHK1 , or 
a fragment or mutant thereof. These polynucleotides can be used for expressing NRHK1 , or 
a fragment or mutant thereof. The protein products thus expressed can be used to screen for 
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agents/drugs that modulate an activity of NRHK1 . In addition, these polynucleotides can be 
used to designing gene therapy vectors which target the expression of the NRHK1 gene or 
an activity of NRHK1 in humans. 

[0115] A polynucleotide comprising SEQ ID NO:l or SEQ ID NO:3 can be prepared 
using standard molecular biology techniques as appreciated by one of ordinary skill in the 
art For instance, primers derived from the 5' and 3' ends of SEQ ID NO:l can be used to 
amplify mRNAs isolated from human tissues. The cDNA thus produced contains SEQ ID 
NO:l . Likewise, primers for amplifying the human genomic sequence containing SEQ ID 
NO-3 can be designed and used to prepare the genomic sequence of the NRHK1 gene. A 
variant (such as a homolog) or a fragment of SEQ ID NO:l or SEQ ID NO:3 can be 
similarly prepared. Alternatively, probes can be designed to screen for cDNA or genomic 
sequence libraries in order to identify polynucleotide molecules comprising the full-length 
or fragments of SEQ ID NO:l or SEQ ID NO:3. The molecules thus identified can be used 
to create suitable vectors comprising the full-length SEQ ID NO:l or SEQ ID NO:3. 
[0116] Polynucleotides capable of hybridizing to the NRHK1 gene can be prepared by 
standard synthetic techniques, e.g., using an automated DNA synthesize, Preferably, the 
polynucleotide probes can hybridize to the NRHK1 gene under reduced stringent 
conditions, stringent conditions, or highly stringent conditions. In one embodiment, the 
polynucleotides comprise at least 15, 20, 25, 30, 50, 100, 150, 200, 250, 300, 350, 400, 450, 
500 550, 600, 650, 700, 750, 800, 850, 900, 950, 1000 or more consecutive nucleotides of 
SEQ ID NO:l. Any fragments of SEQ ID NO: 1 and SEQ ID NO: 3 may be used as 
hybridization probes or PCR primers for the NRHK1 gene or its transcripts. The 
probes/primers can be substantially purified. 

[0117] In a preferred embodiment, the hybridization probes for the NRHK1 gene 
comprise a label group. The label group can be a radioisotope, a fluorescent compound, an 
enzyme, or an enzyme co-factor. Probes thus labeled can be used as part of a diagnostic kit 
for determining the expression level of the NRHK1 gene in human tissues. 
[0118] This invention encompasses human NRHK1 gene homologs in other species. 
These homologs can be determined by search different sequence databases, such as the 
Entrez/GenBank sequence databases maintained by the NCBI. The invention also 
encompasses polynucleotide molecules which are structurally different from the molecules 
described above, but have the substantially same properties as the molecules described 
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above. Such molecules include allelic variants, which will be described below in greater 
detail. 

[0119] DNA sequence polymorphism in human NRHKl gene exists among different 
individuals due to natural allelic variations. An allele is one of a group of genes which 
occur alternatively at a given genetic locus. DNA polymorphisms that affect the RNA 
expression level of the NRHKl gene can also exist, e.g., through affecting the regulation or 
degradation of expression of the gene. The present invention contemplates all allelic 
variants of human NRHKl gene. Allelic variants and other homologs of the NRHKl gene 
can be isolated using probes/primers derived from SEQ ID NO: 1 or SEQ ID NO:3. 
[0120] It should, of course, be understood that SEQ ID NO: 1 and SEQ ID NO:3 can be 
modified. The modified polynucleotides can comprise one or more mutations. These 
mutations can be substitutions, additions or deletions of 1, 2, 3, 5, 10, 15, 20 or more 
nucleotide residues in SEQ ID NO:l or SEQ ID NO:3. Standard techniques can be used, 
such as site-directed mutagenesis or PCR-mediated mutagenesis. Preferably, these 
mutations create conservative amino acid substitutions. Alternatively, mutations can be 
introduced randomly along all or part of the NRHKl gene or its cDNA, such as by 
saturation mutagenesis. Following mutagenesis, the encoded proteins can be expressed 
recombinantly and their activities can be determined. 

[0121] In one embodiment, nucleotide substitutions leading to amino acid substitutions at 
"non-essential" amino acid residues can be introduced. A "non-essential" amino acid 
residue is a residue that can be altered without changing the biological activity of the 
protein. In contrast, an "essential" amino acid residue is required for the biological activity 
of the protein. Amino acid residues that are conserved among allelic variants or homologs 
of the NRHKl gene from different species preferably are not changed in the present 
invention. 

[0122] Accordingly, another aspect of the invention pertains to NRHKl proteins that 
contain changes in amino acid residues that are not essential for the biological activity of 
NRHKl . These proteins differ in amino acid sequence from the original human NRHKl 
kinase, but retain its biological activity. In one embodiment, the modified protein 
comprises an amino acid sequence at least about 85%, 90%, 95%, 98% or more homologous 
to SEQ ID NO:2. 
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[0123] In another embodiment, NRHK1 proteins contain mutations in amino acid residues 
which result in inhibition of NRHK1 activity. These mutated NRHK1 proteins can be used 
to inhibit NRHK1 activity in patients with disorders related to the aberrant expression of 
NRHK1. 

[0124] A polynucleotide of this invention can be further modified to increase its stability 
in vivo. Possible modifications include, but are not limited to, the addition of flanking 
sequences at the 5'and/or 3'ends; the use of phosphorothioate or 2-o-methyl rather than 
phosphodiester linkages in the backbone; and/or the inclusion of nontraditional bases such 
as inosine, queosine and wybutosine, as well as acetyl- methyl-, thio- and other modified 
forms of adenine, cytidine, guanine, thymine and uridine. 

[0125] Polynucleotide molecules which are antisense to the NRHK1 gene can be 
prepared. An "antisense" polynucleotide comprises a nucleotide sequence which is 
complementary to a "sense" polynucleotide which encodes a protein. An antisense 
polynucleotide can bind via hydrogen bonds to the sense polynucleotide. 
[0126] Antisense polynucleotides of the invention can be designed according to the rules 
of Watson and Crick base pairing. The antisense polynucleotide molecule can be 
complementary to the entire coding region or part of the coding region of the NRHK1 gene. 
The antisense polynucleotide molecule can also be complementary to a "noncoding region" 
in the coding strand of the NRHK1 gene. Preferably, the antisense polynucleotide is an 
oligonucleotide which is antisense to only a portion of the NRHK1 gene. An antisense 
polynucleotide can be, for example, about 5, 10, 15, 20, 25, 30, 35, 40, 45 or 50 nucleotides 
in length. An antisense polynucleotide of the invention can be constructed using chemical 
synthesis and enzymatic ligation reactions as appreciated by one of ordinary skill in the art. 
For example, an antisense polynucleotide can be chemically synthesized using naturally 
occurring nucleotides or variously modified nucleotides designed to increase the biological 
stability of the molecules or to increase the physical stability of the duplex formed between 
the antisense and sense polynucleotides. Examples of modified nucleotides which can be 
used to generate the antisense polynucleotide include 5-fluorouracil, 5-bromouracil, 
5-chlorouracil, 5-iodouracil, hypoxanthine, xantine, 4-acetylcytosine, 
5-(carboxyhydroxymethyl) uracil, 5-carboxymethylaminomethyl-2-thiouridine, 
5-carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, 
N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 
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2-methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 
7-methylguanine,5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, 
beta-D-mannosylqueosine, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio- 
N6-isopentenyl adenosine, unacil-5-oxyacetic acid, wybutoxosine, pseudouracil, queosine, 
2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, uracil-5- 
oxyacetic acid methylester, 3-(3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 
2,6-diaminopurine. Phosphorothioate derivatives and acridine substituted nucleotides can 
also be used. Alternatively, the antisense polynucleotide can be produced biologically using 
an expression vector into which a polynucleotide has been subcloned in an antisense 
orientation (i.e., RNA transcribed from the inserted polynucleotide will be of an antisense 
orientation to the target polynucleotide of interest). 

[0127] The antisense polynucleotides of the invention can be administered to a subject or 
applied in situ such that they hybridize or bind to cellular mRNAs and/or genomic DNAs 
that encode NRHK1 kinase, thereby inhibiting the expression of NRHK1 kinase. The 
hybridization can result in a stable duplex via conventional nucleotide complementarity. An 
example route for administering antisense polynucleotides includes direct injection at a 
tissue site. Antisense polynucleotides can also be modified first, and then administered 
systemically. For example, for systemic administration, antisense molecules can be 
modified such that they specifically bind to receptors or antigens expressed on a selected 
cell surface. Suitable modifications include linking the antisense polynucleotides to 
peptides or antibodies which bind to the cell surface receptors or antigens. In addition, the 
antisense polynucleotides can be delivered to cells using vectors. To achieve sufficient 
intracellular concentrations of the antisense molecules, strong pol II or pol III promoters 
may be used in the vectors. 

[0128] In one embodiment, the antisense polynucleotides are a-anomeric polynucleotides. 
An a-anomeric polynucleotide molecule forms specific double-stranded hybrid with a 
complementary RNA in which, contrary to the usual p-units, the strands run parallel to each 
other. The antisense polynucleotide molecule can also comprise a 2-o-methylribonucleotide 
or a chimeric RNA-DNA analogue. 

[0129] In another embodiment, the antisense polynucleotide is a ribozyme. Ribozymes 
are catalytic RNA molecules with ribonuclease activity which are capable of cleaving a 
single-stranded polynucleotide, such as an mRNA, to which they have a complementary 
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region. Thus, ribozy^es (e.g., hammerhead ribozymes deseribed in Hase.hoif and Gerlach 
Nature 334:585-59!, 1988) ean be used to eatalytieally cleave mRNA transcripts of NRHK1 
in order to inhibit its expression. A ribozyme having specificity for the NRHK1 gene or Us 
transcripts can be designed based upon SEQ ID NO: 1 or 3. mRNAs transcribed from the 
NRHK1 gene ean be used to select from a pool of RNA molecules a catalytic RNA havmg a 
specific ribonuclease activity. 

[0130] Alternatively, the expression of the NRHK1 gene can be inhibited by using 
nucleotide sequences complementary to the regulatory region (e.g., the promoter and/or 
enhancers). These nucleotide sequences can form triple helical structures that prevent 
transcription of the gene in the target cells. 

[•131] Expression of the NRHK1 gene can also be inhibited using RNA interference 
("RNAi") RNAi is a phenomenon in which the introduction of double-stranded RNA 
(dsRNA) into certain organisms or cell types causes degradation of the homologous mRNA. 
First discovered in the nematode Caenorhabditis elegans, RNAi has since been found to 
operate in a wide range of organisms. For example, in mammalian cells, introduce of 
long dsRNA (>30 nt) can initiate a potent antiviral response, exemplified by nonspecific 
inhibition of protein synthesis and RNA degradation. RNA interference provides a 
mechanism of gene silencing at the mRNA level. In recent years, RNAi has become an 
endogenous and potent gene-specific silencing technique that uses double-stranded RNAs 
(dsRNA) to mark a particular transcript for degradation in vivo. It also offers an efficient 
and broadly applicable approach for gene knock-out. In addition, RNAi technology can be 
used for therapeutic purposes. For example, RNAi targeting Fas-mediated apoptosis has 
been shown to protect mice from fulminant hepatitis. RNAi technology has been disclosed 
in numerous publications, such as U.S. Patent Nos. 5,919,619, 6,506,559 and PCT 
Publication Nos. WG99/14346, WO01/70949, WO01/36646, WO00/63364, WOOO/44895, 
WO01/75164, WO01/92513, WO01/68836 and WO01/29058. 

[0132] In a preferred embodiment, short interfering RNAs (siRNA) are used. siRNAs are 
dsRNAs having 19-25 nucleotides. siRNAs can be produced endogenously by degradation 
of longer dsRNA molecules by an RNase Ill-related nuclease called Dicer. siRNAs can 
also be introduced into a cell exogenously or by transcription of an expression construct. 
Once formed, the siRNAs assemble with protein components into endonbonuclease- 
containing complexes known as RNA-induced silencing complexes (RISCs). An ATP- 
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generated unwinding of the siRNA activates the RISCs, which in turn target the 
complementary mRNA transcript by Watson-Crick base-pairing, thereby cleaving and 
destroying the mRNA. Cleavage of the mRNA takes place near the middle of the region 
bound by the siRNA strand. This sequence specific mRNA degradation results in gene 
silencing. 

[0133] At least two ways can be employed to achieve siRNA-mediated gene silencing. 
First, siRNAs can be synthesized in vitro and introduced into cells to transiently suppress 
gene expression. Synthetic siRNA provides an easy and efficient way to achieve RNAi. 
siRNA are duplexes of short mixed oligonucleotides which can include, for example, 19 
nucleotides with symmetric 2 dinucleotide 3'overhangs. Using synthetic 21 bp siRNA 
duplexes (19 RNA bases followed by a UU or dTdT 3' overhang), sequence specific gene 
silencing can be achieved in mammalian cells. These siRNAs can specifically suppress 
targeted gene translation in mammalian cells without activation of DNA-dependent protein 
kinase (PKR) by longer dsRNA, which may result in non-specific repression of translation 
of many proteins. 

[0134] Second, siRNAs can be expressed in vivo from vectors. This approach can be 
used to stably express siRNAs in cells or transgenic animals. In one embodiment, siRNA 
expression vectors are engineered to drive siRNA transcription from polymerase III (pol III) 
transcription units. Pol III transcription units are suitable for hairpin siRNA expression, 
since they deploy a short AT rich transcription termination site that leads to the addition of 
2 bp overhangs (UU) to hairpin siRNAs - a feature that is helpful for siRNA function. The 
Pol III expression vectors can also be used to create transgenic mice that express siRNA. 
[0135] In another embodiment, siRNAs can be expressed in a tissue-specific manner. 
Under this approach, long double-stranded RN As (dsRNAs) are first expressed from a 
promoter (such as CMV (pol II)) in the nuclei of selected cell lines or transgenic mice. The 
long dsRNAs are processed into siRNAs in the nuclei (e.g., by Dicer). The siRNAs exit 
from the nuclei and mediate gene-specific silencing. A similar approach can be used in 
conjunction with tissue-specific (pol II) promoters to create tissue-specific knockdown 



mice. 



[0136] Any 3 'dinucleotide overhang, such as UU, can be used for siRNAs. In some 
eases, G residues in the overhang may be avoided because of the potential for the siRNA to 
be cleaved by RNase at single-stranded G residues. 
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[01371 With regard to the siRNA sequence itself, it has been found that siRNAs with 30- 
50% GC content can be more active than those with a higher G/C content in certain cases. 
Moreover, since a 4-6 nucleotide poly(T) tract may act as a termination signal for RNA pol 
III, stretches of > 4 Ts or As in the target sequence may be avoided in certain cases when 
designing sequences to be expressed from an RNA pol III promoter. In addition, some 
regions of mRNA may be either highly structured or bound by regulatory proteins. Thus, it 
may be helpful to select siRNA target sites at different positions along the length of the 
gene sequence. Finally, the potential target sites can be compared to the appropriate 
genome database (human, mouse, rat, etc.). Any target sequences with more than 16-17 
contiguous base pairs of homology to other coding sequences may be eliminated from 
consideration in certain cases. 

[0138] In one embodiment, siRNA can be designed to have two inverted repeats separated 
by a short spacer sequence and end with a string of Ts that serve as a transcription 
termination site. This design produces an RNA transcript that is predicted to fold into a 
short hairpin siRNA. The selection of siRNA target sequence, the length of the inverted 
repeats that encode the stem of a putative hairpin, the order of the inverted repeats, the 
length and composition of the spacer sequence that encodes the loop of the hairpin, and the 
presence or absence of S'-overhangs, can vary to achieve desirable results. 
[0139] The siRNA targets can be selected by scanning an mRNA sequence for AA 
dinucleotides and recording the 19 nucleotides immediately downstream of the AA. Other 
methods can also been used to select the siRNA targets. In one example, the selection of 
the siRNA target sequence is purely empirically determined (see e.g., Sui et al., Proc. Natl. 
Acad. Sci. USA 99: 5515-5520, 2002), as long as the target sequence starts with GG and 
does not share significant sequence homology with other genes as analyzed by BLAST 
search. In another example, a more elaborate method is employed to select the siRNA 
target sequences. This procedure exploits an observation that any accessible site in 
endogenous mRNA can be targeted for degradation by synthetic oligodeoxyribonucleotide 
/RNase H method (Lee et al., Nature Biotechnology 20:500-505, 2002). 
[0140] In another embodiment, the hairpin siRNA expression cassette is constructed to 
contain the sense strand of the target, followed by a short spacer, the antisense strand of the 
target, and 5-6 Ts as transcription terminator. The order of the sense and antisense strands 
within the siRNA expression constructs can be altered without affecting the gene silencing 
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activities of the hairpin siRNA. In certain instances, the reversal of the order may cause 
partial reduction in gene silencing activities. 

[0141] The length of nucleotide sequence being used as the stem of siRNA expression 
cassette can range, for instance, from 19 to 29. The loop size can range from 3 to 23 
nucleotides. Other lengths and/or loop sizes can also be used. 

[0142] In yet another embodiment, a 5'overhang in the hairpin siRNA construct can be 
used, provided that the hairpin siRNA is functional in gene silencing. In one specific 
example, the 5 'overhang includes about 6 nucleotide residues. 

[0143] In a preferred embodiment, the target sequence for RNAi is a 21-mer sequence 
fragment selected from SEQ ID NO:l. The 5' end of the target sequence has dinucleotide 
"NA," where "N" can be any base and "A" represents adenine. The remaining 19-mer 
sequence has a GC content of between 45% and 55%. In addition, the remaining 19-mer 
sequence does not include (1) any three consecutive identical bases (i.e., GGG, CCC, TTT, 
or AAA); (2) seven "GC" in a role; and (3) any palindrome sequence with 5 or more bases. 
Furthermore, the target sequence has low sequence homology to other human genes. In one 
specific example, potential target sequences are searched by BLASTN against NCBI's 
human UniGene cluster sequence database. The human UniGene database contains non- 
redundant sets of gene-oriented clusters. Each UniGene cluster includes sequences that 
represent a unique gene. Fragments of SEQ ID NO: 1 that produce no hit to other human 
genes under BLASTN search are selected as the preferred candidate sequences for RNAi. 
During the search, the e-value may be set at a stringent value (such as "1"). Table 4 lists 
exemplary NRHK1 gene target sequences for RNAi prepared using the above-described 
criteria. The siRNA sequences for each target sequence (the sense strand and the antisense 
strand) are also disclosed. In addition, the 5' end location of each target sequence in SEQ 
ID NO:l is identified ("5 End"). 
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[0144] In yet another embodiment, the polynucleotides of the present invention can be 
modified at the base moiety, sugar moiety or phosphate backbone to improve the stability, 
hybridization, or solubility of the molecules. For instance, the deoxyribose phosphate 
backbone of the polynucleotide molecules can be modified to generate peptide 
polynucleotides (see Hyrup B. et al. Bioorganic & Medicinal Chemistry 4:523, 1996). As 
used herein, the terms "peptide polynucleotides" or "PNAs" refer to polynucleotide mimics, 
e.g., DNA mimics, in which the deoxyribose phosphate backbone is replaced by a 
pseudopeptide backbone and only the four natural nucleobases are retained. The neutral 
backbone of PNAs has been shown to allow for specific hybridization to DNA and RNA 
under conditions of low ionic strength. PNA oligomers can be synthesized using standard 
solid phase peptide synthesis protocols. 

[0145] PNAs can be used in therapeutic and diagnostic applications. For example, PNAs 
can be used as antisense agents for sequence-specific modulation of the NRHK1 gene 
expression. PNAs can also be used in the analysis of single base pair mutations in a gene, 
(e.g., by PNA-directed PCR clamping); as artificial restriction enzymes when used in 
combination with other enzymes, (e.g., SI nucleases); or as probes or primers for DNA 
sequencing or hybridization. 

[0146] In one embodiment, PNAs can be modified to enhance their stability or cellular 
uptake by attaching lipophilic or other helper groups to PNA, by the formation of PNA- 
DNA chimeras, or by the use of liposomes or other drug delivery techniques known in the 
art. For example, PNA-DNA chimeras of the polynucleotides of the invention can be 
generated. These chimeras allow DNA recognition enzymes, such as RNase H and DNA 
polymerases, to interact with the DNA portion while the PNA portion provides high binding 
affinity and specificity. PNA-DNA chimeras can be linked using linkers of appropriate 
lengths which are selected based on base stacking, number of bonds between the 
nucleobases, and orientations. The PNA-DNA chimeras can be synthesized as follows. A 
DNA chain is synthesized on a solid support using standard phosphoramidite coupling 
chemistry and modified nucleoside analogs. PNA monomers are then coupled in a stepwise 
manner to produce a chimeric molecule with a 5 1 PNA segment and a 3' DNA segment. 
Alternatively, chimeric molecules can be synthesized with a 5' DNA segment and a 3' PNA 
segment. 
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[0147] In other embodiments, the polynucleotides of this invention may include other 
appended groups such as peptides (e.g., for targeting host cell receptors in vivo), or agents 
facilitating transportation across the cell membrane or the blood-kidney barrier (see, e.g., 
PCT Publication No. W089/1 0 1 34). In addition, polynucleotides can be modified using 
hybridization-triggered cleavage agents or intercalating agents. To this end, the 
polynucleotides can be conjugated to another molecule (e.g., a peptide, hybridization 
triggered cross-linking agent, transport agent, or hybridization-triggered cleavage agent). 
Furthermore, the polynucleotide can be detectably labeled. 
Polypeptides and Va riants Thereof 

[0148] Several aspects of the invention pertain to isolated NRHK1 polypeptides and 
mutated NRHK.1 polypeptides capable of inhibiting normal NRHK1 activity. The present 
invention also contemplates immunogenic polypeptide fragments suitable for raising anti- 
NRHK1 antibodies. 

[0149] In one embodiment, native NRHK1 polypeptides can be isolated from cells or 
tissue sources by using standard protein purification techniques. Standard purification 
methods include electrophoresis, molecular, immunological and chromatographic 
techniques. Specific examples include ion exchange, hydrophobic, affinity or reverse-phase 
HPLC chromatography, and chromatofocusing. In one embodiment, NRHK1 polypeptides 
are purified using a standard affinity column coupled with anti-NRHKl antibodies. 
Ultrafiltration and diafiltration techniques can also be used. The degree of purification 
depends on the purpose of the use of the NRHK1 polypeptides. In some instances, 
purification is not necessary. 

[0150] In another embodiment, NRHK1 polypeptides or mutated NRHK1 polypeptides 
capable of inhibiting normal NRHK1 activity are produced by recombinant DNA 
techniques. Alternative to recombinant expression, NRHK1 polypeptides or mutated 
NRHK1 polypeptides can be synthesized chemically using standard peptide synthesis 
techniques. 

[0151] The invention provides NRHK1 polypeptides encoded by the human NRHK1 
gene, or homologs thereof. The polypeptides of this invention can be substantially 
homologous to human NRHK1 kinase (SEQ ID NO:2). Preferably, these polypeptides 
retain the biological activity of the native NRHK.1 kinase. In one embodiment, the 
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polypeptides comprise an amino acid sequence which is at least about 85%, 90%, 95%, 98% 
or more homologous to SEQ ID NO:2. 

[0152] Comparison of sequences and determination of percent identity between two 
sequences can be accomplished using a mathematical algorithm. The percent identity 
between two amino acid sequences can be determined using the Needleman and Wunsch (J. 
Mol. Biol. 48:444-453, 1970) algorithm, or the GAP program in the GCG software package 
which uses either a Blossom 62 matrix or a PAM250 matrix, and a gap weight of 16, 14, 12, 
10, 8, 6, or 4 and a length weight of 1 , 2, 3, 4, 5, or 6. The percent identity between two 
nucleotide sequences can be determined using the GAP program in the GCG software 
package, which uses a NWSgapdna.CMP matrix and a gap weight of 40, 50, 60, 70, or 80 
and a length weight of 1 , 2, 3, 4, 5, or 6. The percent identity between two amino acid or 
nucleotide sequences can also be determined using the algorithm of E. Meyers and W. 
Miller (CABIOS, 4:1 1-17, 1989) which has been incorporated into the ALIGN program 
(version 2.0), or the pairwise BLAST program available at NCBI's BLAST web site. 

[0153] The polypeptide and polynucleotide sequences of the present invention can be 
used as query sequences for searching public databases in order to identify similar 
sequences. The search can be conducted using BLAST programs, such as the protein 
BLAST, nucleotide BLAST, pairwise BLAST, and genomic BLAST, that are available at 
the BLAST web site maintained by the NCBI. When using BLAST programs, the default 
parameters of the respective programs can also be used. 

[0154] The invention further provides chimeric or fusion NRHK1 polypeptides. A fusion 
NRHK1 polypeptide contains an NRHK1 -related polypeptide and a non-NRHKl 
polypeptide. The NRHK1 -related polypeptides include all or a portion of SEQ ID NO:2 or 
its variant. A peptide linker sequence can be employed to separate the NRHK1 -related 
polypeptide from the non-NRHKl polypeptide components by a distance sufficient to 
ensure that each polypeptide folds into its native secondary and tertiary structures. Such a 
peptide linker sequence is incorporated into the fusion protein using standard techniques 
well known in the art. Suitable peptide linker sequences can be chosen based on the 
following factors: (1) their ability to adopt a flexible extended conformation; (2) their 
inability to adopt a secondary structure that could interact with functional epitopes on the 
NRHK1 -related polypeptide and non-NRHKl polypeptide; and (3) the lack of hydrophobic 
or charged residues that might react with the polypeptide functional epitopes. Preferred 
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peptide linker sequences contain Gly, Asn and Ser residues. Other near neutral amino acids, 
such as Thr and Ala can also be used in the linker sequence. Amino acid sequences suitable 
as linkers include those disclosed in Maratea et ah, Gene 40:39-46, 1985; Murphy et al., 
Proc. Natl. Acad. Sci. USA 83:8258-8262, 1986; U.S. Pat. No. 4,935,233 and U.S. Pat No. 
4,751,180. The linker sequences may be from 1 to about 50 amino acids in length. Linker 
sequences are not required when the NRHK1 -related polypeptide or the non-NRHKl 
polypeptide has non-essential N-terminal amino acid regions that can be used to separate the 
respective functional domains and thereby prevent steric interference. 

[0155] In one embodiment, the fusion protein is a GST-NRHK1 fusion protein in which 
an NRHK1 -related sequence, such as SEQ ID NO:2, is fused to the C-terminus of the GST 
sequence. This fusion protein can facilitate the purification of the recombinant NRHK1 . 

[0156] The NRHK1 -fusion proteins of the invention can be incorporated into 
pharmaceutical compositions and administered to a subject. The NRHK1 -fusion proteins 
can be used to affect the bioavailability of an NRHK1 substrate. The NRHK1 -fusion 
proteins can also be used for the treatment or prevention of damages caused by (i) aberrant 
modification or mutation of NRHK1, or (ii) aberrant post-translational modification of 
NRHK1 . It is also conceivable that a fusion protein containing a normal or mutated 
NRHK1 polypeptide, or a fragment thereof, can be used to inhibit NRHK1 activity in a 
human subject. 

[0157] Moreover, the NRHK1 -fusion proteins can be used as immunogens to produce 
anti-NRHKl antibodies. They can also be used to purify NRHK1 ligands and to screen for 
molecules capable of inhibiting the interaction between NRHK1 and its substrates. 

[0158] Preferably, the NRHK1 -chimeric or fusion proteins of the invention are produced 
using standard recombinant DNA techniques. Commercially available expression vectors 
which encode a fusion moiety (e.g., a GST polypeptide) can be used. 

[0159] A signal sequence can be used to facilitate secretion and isolation of the secreted 
protein or other proteins of interest. Signal sequences are typically characterized by a core 
of hydrophobic amino acids which are generally cleaved from the mature protein. Such 
signal peptides contain processing sites that allow cleavage of the signal sequence from the 
mature proteins as they pass through the secretory pathway. The present invention 
encompasses NRHK1 polypeptides having a signal sequence, or the polynucleotide 
sequences encoding the same. 
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,0160] The present invention also pertains to NRHK1 mutants whieh fttnetton as 
antagonists to NRHK1. In one embodiment, antagonists ofNRHKl are used as the ra pen.,c 
agents. For examp.e, a mntan. ofNRHKl rha. forms a non-funetional dtaer with a w.de- 
type NRHK1 (the so-called dominant negative mutant) can decrease the activity ofNRHKl 
and may ameliorate diseases in a subject wherein NRHK1 are abnormally increased tn .eve. 
or activity. Dominant negative NRHK1 mutants can be generated by mutageneses, as 
appreciated by one skilled in the art. 

,01611 NRHK1 mutants which function as either NRHK1 agonists or antagonists can be 
identified by screening combinatorial libraries of mutants. A variegated .ibrary ofNRHKl 
mutants can be produced by, for example, enzymatically ligating a mixture of synthetic 
oligonucleotides into gene sequences such mat a degenerate set of potential NRHK1 
sequences is expressible as individual polypeptides, or alternatively, as a se, of larger fuston 
proteins containing the se, ofNRHKl sequences therein. There are a variety of methods 
which can be used to produce libraries of potential NRHK1 mutants from a degenerate 
oligonuc.eot.de sequence. A degenerate gene sequence can be chemically synthestzed 
using an automatic DN A synthesize, The synthetic gene can then be ligated mto an 
appropriate expression vector. 

,01621 In one embodiment, a Hbrary of coding sequences can be generated using 
nucleases. For tnstance, double stranded PCR fragments of the NRHK1 coding sequence 
can be treated by a nuclease which produces about one nick per molecule. The double- 
stranded DNAs men are subject to a cycle of denaturing and re-„aturi»g. The newly 
reformed DNAs, which may mc.ude sense/antisense pairs from different nicked products, 
are treated with SI nuclease to remove single stranded portions. Using this method, an 
expression library which encodes N-termina,, C-termina, or internal fragments ofNRHKl 
can be derived. 

10163] In addition, recursive ensemble mutagenesis (REM), a technique which enhances 
the frequency of functional mutants in the libraries, can be used to prepare NRHK1 mutants 
(Delgrave et al, Protein Engineering 6:327-331, 1993). 

[0164] NRHK1 fragments, or variants thereof, can also be generated using synthetic 
means, such as solid-phase synthesis methods. Preferably, the synthesized fragment has 
less than about 100 amino acids, or preferably, less than about 50 amino acids. 
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Antibodies 

,0165] In accordance with another aspect of the present invention, antiboches specie to 
NRHKl or its variants are prepared. An antibody is considered to bind "specifically" to an 
antigen if the binding affinity between the antibody and the antigen is equal to, or greater 
than 10 5 NT 1 . The antibodies can be monoclonal or polyclonal. Preferably, the antibodies 
are monoclonal. More preferably, the antibodies are humanized antibodies. 
[0166] Polyclonal anti-NRHKl antibodies can be prepared by immunizing a suitable 
subject with NRHKl or fragments thereof. The anti-NRHKl antibody titer m the 
hnmunized subject can be monitored over the time using standard techniques, such as 
ELISA. The anti-NRHKl antibody can be isolated from the immunized subject usmg well 
known techniques. 

,0,671 in one embodiment, hybridomas capable of producing anti-NRHKl antibodies are 
prepared Purified NRHKl or its variants, or fragments thereof, are used to tmmuntze a 
vertebrate, such as a mammal. Suitable mammals include mice, rabbits and sheep. 
Preferably, the fragment used for immunization comprises a, leas. 8 amino acid resrdues, 
more preferabfy a. leas. 1 2 amino acid residues, highly preferably a, leas, 16 amino acd 
residues, and most preferably at least 20 amino acid residues. 

,0,68] immunogenic fragments (epitopes) of NRHKl can be identified using well known 
techniques. In general, any fragment of SEQ ID NO:2 can be used to raise anttbodres 
specific to NRHKl . Preferred epitopes are regtons that are located on the surface of 
NRHKl . These regions are usually hydrophilic 

,01691 Splenocytes are rsolated from the immunized vertebrate and fused with an 
immortalized cel. hue (such as a myeloma) to form hybridomas. Ptsferably, .he immortal 
cell fine is derived from me same mammalian species as tire lymphocytes. For example, 
murine hybridomas can be made by ftrsing an immortalized mouse cell l.ne w„„ 
Wmphocy.es isolated from a mouse .ha. is immunized with an tmmunogenic preparation of 
the presen. invention. Preferred immortalized cel. lines include mouse myeloma cell tines 
th a. are sensitive to culture medtutn contenting hypoxanfhme, amrnopterin and thymtdme 
("HAT medium"). Suitable myeloma cell lines include, but are no. limited to, the 
P3-NSl/l-Ag4-l, P3-x63-A g 8.653 or Sp210-Agl4 myeloma lines, all of which are avatiable 
from ATCC. in one embodiment, HAT-sensitive mouse myeloma cells are fused to mouse 
splenocytes using polyethylene glycol ("PEG"). Hybridoma cells Ihus produced are 
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selected against HAT medium, which kills unfused or unproductively fused myeloma cells. 
Hybridoma cells which produce monoclonal anti-NRHKl antibodies are then detected by 
screening the hybridoma culture supernatants. 

[0170] A monoclonal anti-NRHKl antibody can also be prepared by screening a 
recombinant combinatorial immunoglobulin library {e.g., an antibody phase display library). 
Kits for generating and screening phage display libraries are commercially available {e.g., 
the Pharmacia Recombinant Phage Antibody System, Catalog No. 27-9400-01; and the 
Stratagene SurfZAP™ Phage Display Kit, Catalog No. 240612). 

[0171] The anti-NRHKl antibodies of the present invention also include "single-chain 
Fv" or "scFv." The scFv fragments comprise the V H and V L domains of an antibody. 
Generally, the scFv polypeptide further comprises a polypeptide linker between the V H and 
V L domains. The polypeptide linker enables the scFv to form the desired structure for 
antigen binding. Additionally, recombinant anti-NRHKl antibodies, such as chimeric and 
humanized monoclonal antibodies, can be prepared, as appreciated by one of ordinary skill 
in the art. 

[0172] Humanized antibodies are particularly desirable for therapeutic treatment of 
human subjects. Humanized forms of non-human {e.g., murine) antibodies are chimeric 
immunoglobulins, immunoglobulin chains, or fragments thereof (such as Fv, Fab, Fab 1 , 
F(ab') 2 or other antigen-binding subsequences of antibodies) which contain minimal 
sequence derived from non-human immunoglobulin. Humanized antibodies are derived 
from human immunoglobulins in which the residues forming the complementary 
determining regions (CDRs) are replaced by the residues from CDRs of a non-human 
antibody, such as a mouse, rat or rabbit antibody having the desired specificity, affinity and 
capacity. In some instances, Fv framework residues of the human immunoglobulin are 
replaced by corresponding non-human residues. Humanized antibodies may also comprise 
residues which are found neither in the recipient antibody nor in the imported CDR or 
framework sequences. The humanized antibody can comprise at least one or two variable 
domains, in which all or substantially all of the CDR regions correspond to those of a non- 
human immunoglobulin and all or substantially all of the constant regions are those of a 
human immunoglobulin consensus sequence. The humanized antibody preferably 
comprises at least a portion of an immunoglobulin constant region (Fc) of a human 
immunoglobulin. 
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[0173] Humanized antibodies can be produced using transgenic mice which are incapable 
of expressing endogenous immunoglobulin heavy and light chains but can express human 
heavy and light chains. The transgenic mice are immunized in the normal fashion with a 
selected antigen. Monoclonal antibodies directed against the antigen can be obtained using 
conventional hybridoma technology. The human immunoglobulin transgenes harbored in 
the transgenic mice rearrange during B cell differentiation, and subsequently undergo class 
switching and somatic mutation. Using this technique, therapeutically useful IgG, IgA and 
IgE antibodies can be prepared. 

[0174] In addition, humanized antibodies which recognize a selected epitope can be 
generated using a technique referred to as "guided selection." In this approach a selected 
non-human monoclonal antibody, e.g. , a murine antibody, is used to guide the selection of a 
humanized antibody recognizing the same epitope. 

[0175] In a preferred embodiment, the antibodies to NRHK1 are capable of reducing or 
eliminating the biological function of NRHK1 . Preferably, the antibodies reduce at least 
25% of NRHK1 activity. More preferably, the antibodies reduce at least about 50% of the 
activity. Highly preferably, the antibodies reduce about 95-100% of NRHK1 activity. 
[0176] Anti-NRHKl antibodies can be used to isolate NRHK1 . Suitable methods include 
affinity chromatography and immunoprecipitation. Moreover, anti-NRHKl antibodies can 
be used to evaluate the expression level of NRHK1 . Anti-NRHKl antibodies can also be 
used to monitor NRHK1 level as part of a clinical testing procedure, or to determine the 
efficacy of a given treatment regimen. Detection can be facilitated by coupling the antibody 
to a detectable substance. Examples of detectable substances include various enzymes, 
prosthetic groups, fluorescent materials, luminescent materials, bioluminescent matenals, 
and radioactive materials. Examples of suitable enzymes include horseradish peroxidase, 
alkaline phosphatase, galactosidase, or acetylcholinesterase; examples of suitable prosthetic 
group complexes include streptavidin/biotin and avidin/biotin; examples of suitable 
fluorescent materials include umbelliferone, fluorescein, fluorescein isothiocyanate, 
rhodamine, dichlorotriazinylamine fluorescein, dansyl chloride or phycoerythrin; an 
example of a luminescent material includes luminol; examples of bioluminescent matenals 
include luciferase, luciferin, and aequorin; and examples of suitable radioactive matenals 
include ,25 I, 13, I, 35 Sor 3 H. 
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[0177] Anti-NRHKl antibodies are also useful for targeting a therapeutic agent/drug to a 
particular cell or tissue. The therapeutic agent/drug may be coupled to an antibody, either 
covalently or non-covalently. For instance, a therapeutic agent can be coupled to an 
antibody via a linker group. A linker group can function as a spacer to separate the 
antibody from the agent so as to avoid interference with antibody's binding capabilities. 
The linker group can also serve to increase the chemical reactivity of a substituent on the 
agent or the antibody, and thus increase the coupling efficiency. A variety of Afunctional 
or polyfunction^ reagents, either homo- or hetero-functional (such as those described in the 
catalog of the Pierce Chemical Co., Rockford, 111.), can be employed as the linker group. 
Coupling may be effected, for example, through amino groups, carboxyl groups, sulfhydryl 
groups or oxidized carbohydrate residues. There are numerous references describing this 
methodology. See e.g., U.S. Pat. No. 4,671,958. 

[0178] Where a therapeutic agent is more potent when free from the antibody, it may be 
desirable to use a linker group which is cleavable during or upon internalization into the 
target cell. A number of different cleavable linker groups have been described. The 
mechanisms for the intracellular release of an agent from these linker groups include 
cleavage by reduction of a disulfide bond (e.g., U.S. Pat. No. 4,489,710), by irradiation of a 
photolabile bond (e.g., U.S. Pat. No. 4,625,014), by hydrolysis of derivatized amino acid 
side chains (e.g., U.S. Pat. No. 4,638,045), by serum complement-mediated hydrolysis (e.g., 
U.S. Pat. No. 4,671,958), or by acid-catalyzed hydrolysis (e.g., U.S. Pat. No. 4,569,789). 
[0179] It may also be desirable to couple more than one agent to an antibody. In one 
embodiment, multiple agents are coupled to one antibody molecule. In another 
embodiment, at least two different types of agents are coupled to one antibody. Regardless 
of the particular embodiment, immunoconjugates coupled with more than one agent can be 
prepared in a variety of ways, as appreciated by one of ordinary skill in the art. 
Vectors. E x pression Vectors and Gene Delive ry Vectors 

[0180] Another aspect of the invention pertains to vectors containing a polynucleotide 
encoding NRHK1 or a portion thereof. One type of vector is a "plasmid," which includes a 
circular double stranded DNA into which additional DNA segments can be introduced. 
Vectors also include expression vectors and gene delivery vectors. 
[0181] The expression vectors of the present invention comprise a polynucleotide 
encoding NRHK1 or a portion thereof. The expression vectors also include one or more 
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regulatory sequences operably linked to the polynucleotide being expressed. These 
regulatory sequences are selected based on the type of host cells. It will be appreciated by 
those skilled in the art that the design of the expression vector depends on such factors as 
the choice of the host cells and the desired expression levels. NRHK1 can be expressed in 
bacterial cells such as E. coli, insect cells (using baculovirus expression vectors), yeast cells 
or mammalian cells. The expression vector can also be transcribed and translated in vitro, 
for example, by using T7 promoter regulatory sequences and T7 polymerase. 
[0182] Expression of proteins in prokaryotes is most often carried out in E. coli with 
vectors containing constitutive or inducible promoters directing the expression of either 
fusion or non-fusion proteins. Fusion vectors add a number of amino acids to a protein 
encoded therein, usually to the amino terminus of the recombinant protein. Such fusion 
vectors typically serve three purposes: 1) to increase expression of the recombinant protein; 
2) to increase the solubility of the recombinant protein; and 3) to aid in the purification of 
the recombinant protein by acting as a ligand in affinity purification. Often, in fusion 
expression vectors, a proteolytic cleavage site is introduced at the junction of the fusion 
moiety and the recombinant protein to enable separation of the recombinant protein from 
the fusion moiety subsequent to purification of the fusion protein. Suitable cleavage 
enzymes include Factor Xa, thrombin and enterokinase. Examples of fusion expression 
vectors include pGEX (Pharmacia Piscataway, NJ), pMAL (New England Biolabs, Beverly, 
MA) and pRITS (Pharmacia, Piscataway, NJ). Purified fusion proteins can be utilized in 
NRHK1 activity assays, or to generate antibodies specific for NRHK1. 
[0183] Examples of suitable inducible non-fusion E. coli expression vectors include pTrc 
and pET 1 Id. Target gene expression from the pTrc vector relies on host RNA polymerase 
transcription from a hybrid trp-lac fusion promoter. Target gene expression from the pET 
1 Id vector relies on transcription from a T7 gnlO-lac fusion promoter mediated by a co- 
expressed viral RNA polymerase (T7 gnl). This viral polymerase is supplied by host 
strains BL21(DE3) or HSLE174(DE3) from a resident prophage harboring a T7 gnl gene 
under the transcriptional control of the lacUV 5 promoter. 

[0184] One strategy to maximize recombinant protein expression in E. coli is to express 
the protein in host bacteria that have an impaired capacity to proteolytically cleave the 
recombinant protein. Another strategy is to alter the polynucleotide sequence encoding the 
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protein so that the individual codons for each amino acid are those preferentially utilized in 
E. coli. 

[0185] In another embodiment, the NRHK1 expression vector is a yeast expression 
vector. Examples of yeast expression vectors include pYepSecl, pMFa, pJRY88, pYES2 
(Invitrogen Corporation, San Diego, CA), and picZ (Invitrogen Corp, San Diego, CA). 
[0186] Alternatively, NRHK1 or its variant can be expressed in insect cells using 
baculovirus expression vectors. Suitable baculovirus vectors include the pAc series and the 
pVL series. 

[0187] In yet another embodiment, NRHK1 or its variant is expressed in mammalian cells 
using a mammalian expression vector. Examples of mammalian expression vectors include 
pCDM8 and pMT2PC. When used in mammalian cells, the expression vector's control 
functions are often provided by viral regulatory elements. For example, commonly used 
promoters are derived from polyoma, adenovirus 2, cytomegalovirus and Simian Virus 40. 
[0188] In another embodiment, the mammalian expression vector contains tissue-specific 
regulatory elements. Examples of suitable tissue-specific promoters include the liver- 
specific albumin promoter, lymphoid-specific promoters, promoters of T cell receptors and 
immunoglobulins, neuron-specific promoters {e.g., the neurofilament promoter), pancreas- 
specific promoters, and mammary gland-specific promoters {e.g., milk whey promoter). 
Developmentally-regulated promoters are also contemplated, which include, for example, 
the a-fetoprotein promoter. 

[0189] The present invention also provides a recombinant expression vector comprising a 
polynucleotide which encodes NRHK1 but is cloned into the expression vector in an 
antisense orientation. Regulatory sequences that are operatively linked to the antisense- 
oriented polynucleotide can be chosen to direct continuous expression of the antisense RNA 
molecule in a variety of cell types. Suitable regulatory sequences include viral promoters 
and/or enhancers. Regulatory sequences can also be chosen to direct constitutive, tissue 
specific or cell type specific expression of the antisense RNA. The antisense expression 
vector can be in the form of a recombinant plasmid, phagemid, or attenuated virus in which 
antisense polynucleotides are produced under the control of a highly efficient regulatory 
region. 
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[0190] The present invention further provides gene delivery vehicles for delivering 
polynucleotides to mammals. A polynucleotide sequence of the invention can be 
administered either locally or systemically via a gene delivery vehicle. Expression of the 
polynucleotide can be induced using endogenous mammalian or heterologous promoters. 
Expression of the polynucleotide in vivo can be either constituted or regulated. The gene 
delivery vehicles preferably are viral vectors, including retroviral, lentiviral, adenoviral, 
adeno-associated viral (AAV), herpes viral, or alphavirus vectors. The viral vectors can 
also be astrovirus, coronavirus, orthomyxovirus, papovavirus, paramyxovirus, parvovirus, 
picomavirus, poxvirus, or togavirus vectors. 

[0191] Delivery of gene therapy constructs is not limited to the above mentioned viral 
vectors. Other delivery methods can also be employed. These methods include nucleic acid 
expression vectors, polycationic condensed DNA linked or unlinked to killed adenovirus, 
ligand linked DNA, liposome-DNA conjugates, gene guns, ionizing radiation, nucleic 
charge neutralization, or fusion with cell membranes. Naked DNA can also be employed. 
Uptake efficiency of the naked DNA may be improved using biodegradable latex beads. 
This method can be further improved by treating the beads to increase their hydrophobicity. 

Regulatable Expression Systems 

[0192] Another aspect of the present invention pertains to the use of regulatable 
expression systems to express desirable polynucleotides or polypeptides in cells. Systems 
suitable for this invention are briefly described below: 

[0193] Tet-on/off system. The Tet-system is based on two regulatory elements derived 
from the tetracycline-resi stance operon of the E. coli TnlO transposon: the tet repressor 
protein (TetR) and the Tet operator DNA sequence (tetO) to which TetR binds (Gossen et ^ 
al, Science 268: 1766-1769, 1995). The system consists of two components, a "regulator" 
and a "reporter" plasmid. The "regulator" plasmid encodes a hybrid protein containing a 
mutated Tet repressor (rtetR) fused to the VP16 activation domain of herpes simplex virus. 
The "reporter" plasmid contains a tet-responsive element (TRE), which controls the 
"reporter" gene of choice. The rtetR-VP16 fusion protein can only bind to the TRE, 
therefore activating the transcription of the "reporter" gene in the presence of tetracycline. 
The system has been incorporated into a number of viral vectors including retrovirus, 
adenovirus and AAV. 
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[0194] Ecdysone system. The Ecdysone system is based on the molting induction system 
found in Drosophila, but modified for inducible expression in mammalian cells. The 
system uses an analog of the Drosophila steroid hormone ecdysone, muristerone A, to 
activate expression of the gene of interest via a heterodimeric nuclear receptor. Expression 
levels have been reported to exceed 200-fold over basal levels with no effect on mammalian 
cell physiology (No et al, Proc. Natl. Acad. Sci. USA 93: 3346-3351, 1996). 
[0195] Progesterone-system. The progesterone receptor is normally stimulated to bind to 
a specific DNA sequence and to activate transcription through an interaction with its 
hormone ligand. Conversely, the progesterone antagonist mifepristone (RU486) is able to 
block hormone-induced nuclear transport and subsequent DNA binding. A mutant form of 
the progesterone receptor that can be stimulated to bind through an interaction with RU486 
has been generated. To generate a specific, regulable transcription factor, the RU486- 
binding domain of the progesterone receptor has been fused to the DNA-binding domain of 
the yeast transcription factor GAL4 and the transactivation domain of the HSV protein 
VP16. The chimeric factor is inactive in the absence of RU486. The addition of hormone, 
however, induces a conformational change in the chimeric protein, and this change allows 
binding to a GAL4-binding site and the activation of transcription from promoters 
containing the GAL4-binding site (Wang et al, Nat. Biotech 15: 239-243, 1997). 
[0196] Rapamycin-system. Immunosuppressive agents, such as FK506 and rapamycin, act 
by binding to specific cellular proteins and facilitating their dimerization. For example, the 
binding of rapamycin to FK506-binding protein (FKBP) results in its heterodimerization 
with another rapamycin binding protein FRAP, which can be reversed by removal of the 
drug. The ability to bring two proteins together by addition of a drug potentiates the 
regulation of a number of biological processes, including transcription. A chimeric DNA- 
binding domain has been fused to the FKBP, which enables binding of the fusion protein to 
a specific DNA-binding sequence. A transcriptional activation domain also has been fused 
to FRAP. When these two fusion proteins are co-expressed in the same cell, a fully 
functional transcription factor can be formed by heterodimerization mediated by addition of 
rapamycin. The dimerized chimeric transcription factor can then bind to a synthetic 
promoter sequence containing copies of the synthetic DNA-binding sequence. This system 
has been successfully integrated into adenoviral and AAV vectors. Long term regulable 
gene expression has been achieved in both mice and baboons (Ye et al, Science 283: 88-91, 
1999). 
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Detection Methods 

[0197] In patients with disorders related to the aberrant expression ofNRHKl . The 
expression level ofNRHKl can be used as an indicator for detecting the presence of 
NRHK1 -related diseases in humans. Detection and measurement of the relative amount of 
the NRHK1 gene product can be carried out using various methods known in the art. 
[0198] Typical methodologies for detecting the transcription level of a gene include 
extracting RNA from a cell or tissue sample, hybridizing a labeled probe to the extracted 
RNA or derivative thereof (such as cDNA or cRN A), and detecting the probe. Suitable 
methods include Northern Blot and quantitative RCR or RT-PCR. In situ hybridization can 
also be used to detect the transcription level of the NRHK1 gene in human tissues. 
[0199] Typical methodologies for detecting a polypeptide include extracting proteins 
from a cell or tissue sample, binding an antibody to the target polypeptide and detecting the 
antibody. Suitable methods include enzyme linked immunosorbent assays (ELISAs), 
Western blots, immunoprecipitations, and immunofluorescence. The antibody can be 
polyclonal, or preferably, monoclonal. The antibody can be an intact antibody, or a 
fragment thereof (e.g. Fab or F(ab') 2 ). The antibody can be labeled with a radioisotope, a 
fluorescent compound, an enzyme, an enzyme co-factor, or a detectable ligand. The term 
"labeled," with regard to a probe or antibody, is intended to encompass direct labeling such 
as through covalent coupling, as well as indirect labeling such as being mediated by another 
reagent which is directly labeled. Examples of indirect labeling include labeling a primary 
antibody using a fluorescently labeled secondary antibody, or attaching a DNA probe with a 
biotin which can be detected, for example, by a fluorescence-labeled streptavidin. 
[0200] Preferably, the binding affinity of the antibody to NRHK1 is at least 105 M-l . 
More preferably, the binding affinity is at least 1 06 M-l . Other methods such as 
electrophoresis, chromatography or direct sequencing can also be used to detect the amount 
of a polypeptide in a biological sample. Anti-NRHKl antibodies can also be directly 
introduced into a subject. The antibody can be labeled with a radioactive marker whose 
presence and location in the subject can be detected using standard imaging techniques. 
[0201] In one embodiment, the genomic copies of the NRHK1 gene in the genome of a 
human subject may indicate the presence or predisposition of a disease. Detection of the 
presence or number of copies of the NRHK1 gene in the genome can be performed using 
methods known in the art. For instance, it can be assessed using Southern Blot. The probes 
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for Southern Blot can be labeled with a radioisotope, a fluorescent compound, an enzyme, 
or an enzyme co-factor. 

[0202] In the field of diagnostic assays, the above-described detection methods can be 
used to determine the severity of NRHK1 -related diseases. A biological sample is isolated 
from a test subject, and the presence, quantity and/or activity of NRHK1 in the sample 
relative to a disease free or control sample is evaluated. The expression level of NRHK1 in 
the biological sample can indicate the presence or severity of NRHK1 -related diseases in 
the test subject. The term "biological sample" is intended to include tissues, cells or 
biological fluids isolated from the subject. A preferred biological sample is a serum sample 
isolated from the subject using conventional means. 
Screening Methods 

[0203] The present invention also provides methods for identifying NRHK1 
modulators. Suitable modulators include compounds or agents comprising therapeutic 
moieties, such as peptides, peptidomimetics, peptoids, polynucleotides, small molecules or 
other drugs. These moieties can either bind to NRHK1 , or have a modulatory (e.g., 
stimulatory or inhibitory) effect on the activity of NRHK1. In one embodiment, the 
moieties have a modulatory effect on the interactions of NRHK1 with one or more of its 
natural substrates. These moieties can also exert a modulatory effect on the expression of 
NRHK1 . The screen assays of the present invention comprise detecting the interactions 
between NRHK1 and test components. 

[0204] The test compounds of the present invention can be either small molecules or 
bioactive agents. In a preferred embodiment, the test compound is a small organic or 
inorganic molecule. In another preferred embodiment, the test compounds are polypeptides, 
oligopeptides, polysaccharides, nucleotides or polynucleotides. 
[0205] In accordance with one aspect of this invention, methods for screening for 
compounds that inhibit the biological activities of NRHK1 are provided. Pharmaceutical 
compositions comprising these compounds can subsequently be prepared. The screening 
method comprises (1) contacting a sample with a compound, and (2) comparing expression 
profile or biological activity of NRHK1 in the sample to determine whether the compound 
substantially decreases the expression level or activities of NRHK1. The screening method 
can be carried out either in vivo or in vitro. 
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[0206] The present invention further includes a method for screening for compounds 
capable of modulating the binding between NRHK1 and a binding partner. As used herein, 
the term "binding partner" refers to a bioactive agent which serves as either a substrate for 
NRHK1 , or a ligand having a binding affinity to NRHK1 . The bioactive agent may be 
selected from a variety of naturally-occurring or synthetic compounds, proteins, peptides, 
polysaccharides, nucleotides or polynucleotides. 

[0207] Inhibitors of the expression, activity or binding ability of NRHK1 may be used 
as therapeutic compositions. These inhibitors can be formulated in suitable pharmaceutical 
compositions, as described herein below. 

[0208] The present invention also provides methods for conducting high-throughput 
screening for compounds capable of inhibiting activity or expression of NRHK1. In one 
embodiment, the high-throughput screening method involves contacting test compounds 
with NRHK1 , and then detecting the effect of the test compounds on NRHK1 . Functional 
assays, such as cytosensor microphysiometer-based assays, calcium flux assays (e.g., 
FLIPR®, Molecular Devices Corp, Sunnyvale, CA), or the TUNEL assay, can be employed 
to measure NRHK1 cellular activity. Fluorescence-based techniques can be used for high- 
throughput and ultra high-throughput screening. They include, but are not limited to, 
BRET® and FRET® (both by Packard Instrument Co., Meriden, CT). 
[0209] In a preferred embodiment, the high-throughput screening assay uses label-free 
plasmon resonance technology as provided by BIACORE® systems (Biacore International 
AB, Uppsala, Sweden). Plasmon free resonance occurs when surface plasmon waves are 
excited at a metal/liquid interface. By reflecting directed light from the surface as a result 
of contact with a sample, the surface plasmon resonance causes a change in the refractive 
index at the surface layer. The refractive index change for a given change of mass 
concentration at the surface layer is similar for many bioactive agents (including proteins, 
peptides, lipids and polynucleotides), and since the BIACORE® sensor surface can be 
functionalized to bind a variety of these bioactive agents, detection of a wide selection of 
test compounds can thus be accomplished. 
Monitorin g. Efficacy Of A Drue D uring Clinical Trials 

[0210] Using the NRHK1 detection methods of this invention, the efficacy of a 
therapeutic agent for NRHK1 -related diseases can be monitored during clinical trials. The 
therapeutic agent may be a drug, small molecule, agonist, antagonist, peptidomimetic, 
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protein, peptide, or polynucleotide. The changes in the expression or activity of the 
NRHK1 gene in response to the treatment of the agent can be used to evaluate the 
therapeutic effect of the agent on patients with NRHKl-related diseases. In addition, the 
expression or activity of NRHK1 in response to the agent can be measured at various points 
during the clinical trial. 

[0211] In a preferred embodiment, the method for monitoring the effectiveness of the 
therapeutic agent includes the steps of (i) obtaining a pre-administration sample from a 
subject; (ii) detecting the level of expression or activity of NRHK1 in the pre-administration 
sample; (iii) obtaining one or more post-administration samples from the subject; (iv) 
detecting the level of expression or activity of NRHK1 in the post-administration samples; 
(v) comparing the level of expression or activity of NRHK1 in the pre-administration 
sample to the level of expression or activity of NRHK1 in the post administration samples. 
The dose or frequency of the administration of the agent may be adjusted based on the 
effectiveness of the agent in a particular patient. Therefore, NRHK1 expression or activity 
can be used as an indicator of the effectiveness of a therapeutic agent for NRHKl-related 
diseases, even if the agent does not produce an observable phenotypic response. 

Pro gnostic Assays 

[0212] The detection methods described herein can be used to identify subjects having 
or at risk of developing NRHKl-related diseases. In addition, the detection methods can be 
used to determine whether an agent (e.g., an agonist, antagonist, peptidomimetic, protein, 
peptide, polynucleotide, small molecule, or other drug candidate) can be administered to a 
subject for effectively treating or preventing NRHK1 -related diseases. 
[0213] NRHK1 expression profiles at different progression stages of NRHK1 -related 
diseases can be established. In addition, NRHK1 expression profiles in different patients 
who have different responses to a drug treatment are determined. A pattern may emerge 
such that a particular expression profile may be correlated to an increased likelihood of a 
poor prognosis. Therefore, the prognostic assay of the present invention may be used to 
determine whether a subject undergoing a treatment for a NRHKl-related disease has a poor 
outlook for long term survival or disease progression. Preferably, prognosis is performed 
shortly after diagnosis, such as within a few days after diagnosis. The result of prognosis 
can then be used to devise individualized treatment program, thereby enhancing the 
effectiveness of the treatment as well as the likelihood of long-term survival and well being. 
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[0214] The method of the invention can also be used to detect genetic alterations in the 
NRHK1 gene, thereby determining if a subject with the altered gene is at risk for damages 
characterized by aberrant regulation in NRHK1 activity or expression. In a preferred 
embodiment, the method includes detecting the presence or absence of a genetic alteration 
that affects the integrity of the NRHK1 gene, or detecting the aberrant expression of the 
NRHK1 gene. The genetic alteration can be detected by ascertaining the existence of at 
least one of the following: 1) deletion of one or more nucleotides from the NRHK1 gene; 2) 
addition of one or more nucleotides to the NRHK1 gene; 3) substitution of one or more 
nucleotides of the NRHK1 gene, 4) a chromosomal rearrangement in the NRHK1 gene; 5) 
alteration in the level of a messenger RNA transcript of the NRHK1 gene, 6) aberrant 
modification of the NRHK1 gene, 7) the presence of a non-wild type splicing pattern of a 
messenger RNA transcript of the NRHK1 gene, 8) non-wild type level NRHK1, 9) allelic 
loss of an NRHK1 gene, and 10) inappropriate post-translational modification of NRHK1. 
[0215] In one embodiment, detection of the alteration involves the use of a 
probe/primer in a polymerase chain reaction (such as anchor PCR or RACE PCR) or 
alternatively, in a ligation chain reaction (LCR). LCR can be particularly useful for 
detecting point mutations in the NRHK1 gene. This method includes the steps of collecting 
a sample from a subject, isolating polynucleotides (e.g., genomic DNA, mRNA, or both) 
from the sample, contacting the polynucleotide with one or more primers which specifically 
hybridize to the NRHK1 gene or gene product, and detecting the presence or absence of an 
amplification product, or detecting the size of the amplification product and comparing its 
length to a control. It is understood that PCR and/or LCR can be used as a preliminary 
amplification step in conjunction with any other techniques described herein. 
[0216] Alternative amplification methods include: self sustained sequence replication 
(Guatelli et al., Proc. Natl. Acad. Sci. USA 87:1874-1878, 1990), transcriptional 
amplification system (Kwoh et al., Proc. Natl. Acad. Sci. USA 86:1 173-1 177, 1989), and Q- 
Beta Replicase (Lizardi et al. Bio-Technology 6: 1 1 97, 1 988). 

[0217] In another embodiment, mutations in the NRHK1 gene can be identified using 
restriction enzymes. Differences in restriction enzyme digestion patterns indicates 
mutation(s) in the NRHK1 gene or its transcripts. Moreover, sequence specific ribozymes 
can be used to detect the presence of specific mutations. See, for example, U.S. Patent No. 
5,498,531. 
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[0218] In yet another embodiment, genetic mutations in the NRHK1 gene can be 
identified using high density arrays which contain a large number of oligonucleotides 
probes. For example, genetic mutations in the NRHK1 gene can be identified in two 
dimensional arrays. In this example, a first hybridization array of probes is used to scan 
through long stretches of DNA in a sample and a control in order to identify base changes 
between the two sequences. This step allows the identification of point mutations. This 
step is followed by a second hybridization array that allows the characterization of specific 
mutations by using smaller and specialized probe arrays which are complementary to all 
variants or mutations detected. Each mutation array is composed of parallel probe sets, one 
complementary to the wild-type gene and the other complementary to the mutant gene. 
[0219] In still another embodiment, any sequencing reactions known in the art can be 
used to directly sequence the NRHK1 gene in order to detect mutations. It is contemplated 
that any automated sequencing procedures can be utilized, including sequencing by mass 
spectrometry. 

[0220] In one embodiment, protection from cleavage agents is used to detect 
mismatched bases in RNA/RNA or RNA/DNA heteroduplexes. In general, the "mismatch 
cleavage" technique involves forming heteroduplexes by hybridizing an RNA or DNA 
(labeled) containing the wild-type NRHK1 gene sequence to a potentially mutant RNA or 
DNA obtained from a tissue sample. The double-stranded duplexes are treated with an 
agent which cleaves single-stranded regions of the duplex. The agent may be RNase (for 
RNA/DNA duplexes), or SI nuclease (for DNA/DNA hybrids). In one case, either 
DNA/DNA or RNA/DNA duplexes are treated with piperidine and hydroxylamine, or 
piperidine and osmium tetroxide, in order to digest mismatched regions. After the 
digestion, the resulting material is separated by size on a denaturing polyacrylamide gel 
from which the site(s) of mutation may be determined. 

[0221] In a preferred embodiment, the mismatch cleavage reaction employs one or 
more proteins that recognize mismatched base pairs in double-stranded DNA. Examples of 
these proteins include "DNA mismatch repair" enzymes. For instance, the mutY enzyme of 
E. coli cleaves A at G/A mismatches, and the thymidine DNA glycosylase from HeLa cells 
cleaves T at G/T mismatches. In one case, cDNAs are prepared from mRNAs isolated from 
test cells. The cDNAs are then hybridized to a probe derived from the NRHK1 gene. The 
duplex thus formed is treated with a DNA mismatch repair enzyme, and the cleavage 
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products, if any, can be detected from electrophoresis protocols or the like. See, for 
example, U.S. Patent No. 5,459,039. 

[0222] In another embodiment, alterations in electrophoretic mobility are used to 
identify mutations in the NRHK1 gene. Differences in electrophoretic mobility between 
mutant and wild type polynucleotides can be detected using single strand conformation 
polymorphism (SSCP). The resulting alteration in electrophoretic mobility enables the 
detection of a single base change. The DNA fragments can be labeled or detected with 
probes. In one case, the sensitivity of the assay is enhanced by using RNA, in which the 
secondary structure is more sensitive to a change in sequence. In a preferred embodiment, 
the assay utilizes heteroduplex analysis to separate double stranded heteroduplex molecules 
on the basis of changes in electrophoretic mobility (Keen et aL, Trends Genet 7:5, 1991). 
[0223] In yet another embodiment, the movement of mutant or wild-type fragments is 
evaluated using denaturing gradient gel electrophoresis (DGGE). For this purpose, DNA 
fragments can be modified to insure that they do not completely denature. For instance, a 
GC clamp of approximately 40 GC-rich base pairs can be added to the DNA fragment using 
PCR. In a further embodiment, a temperature gradient is used in place of a denaturing 
gradient (Rosenbaum and Reissner, Biophys Chem 265:12753, 1987). 
[0224] Examples of other techniques for detecting point mutations include, but are not 
limited to, selective oligonucleotide hybridization, selective amplification, or selective 
primer extension. In one embodiment, oligonucleotide primers for specific amplification 
carry the mutation of interest in the center of the molecule (so that amplification depends on 
differential hybridization) or at the extreme 3'end of one primer where, under appropriate 
conditions, mismatch can prevent or reduce polymerase extension. See, for example, Saiki 
et al., Proc. Natl. Acad. Sci USA 86:6230, 1989. In addition, it may be desirable to 
introduce a novel restriction site in the region of the mutation to create cleavage-based 
detection. 

[0225] The methods described herein can be performed using prepackaged diagnostic 
kits which comprise at least one polynucleotide probe or one antibody of the present 
invention. These kits can be used in clinical settings to diagnose subjects exhibiting 
symptoms or family history of a NRHK1 -related disease. Any cell type or tissue in which 
NRHK1 is expressed can be used for prognostic or diagnostic purposes. 
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Prophylactic Methods 

[0226] This invention also provides methods for preventing diseases associated with 
aberrant NRHK1 expression or activity. The methods comprise administering to a target 
subject an agent which modulates NRHK1 expression or activity. 
[0227] Subjects at risk of diseases which are caused by or attributed to aberrant 
NRHK1 expression or activity can be identified using the diagnostic or prognostic assays 
described herein. A prophylactic agent can be administered prior to the manifestation of 
NRHK1 -related disease symptoms in order to prevent or delay NRHKl-related diseases. 
Suitable prophylactic agents include mutant NRHK1 proteins, NRHK1 antagonist agents, or 
NRHK1 antisense polynucleotides. 

[0228] The prophylactic methods of this invention can be specifically tailored or 
modified based on knowledge obtained from the study of pharmacogenomics. 
Pharmacogenomics includes the application of genomics technologies, such as gene 
sequencing, statistical genetics, and gene expression analysis, to drugs which are either in 
clinical development or on the market. IPharmacogenomics can be used to determine a 
subject's response to a drug (e.g., a subject's "drug response phenotype" or "drug response 
genotype"). Thus, another aspect of this invention is to provide methods for tailoring an 
individual's prophylactic or therapeutic treatment using NRHK1 modulators according to 
the individual's drug response genotype. Pharmacogenomics allows a clinician or physician 
to target prophylactic or therapeutic treatments to subjects who will most benefit from the 
treatment and to avoid treatment of subjects who will experience toxic drug-related side 
effects. 

[0229] One pharmacogenomics approach to identify genes that predict drug response, 
known as "a genome-wide association," relies primarily on a high-resolution map of the 
human genome consisting of already known gene-related sites (e.g., a "bi-allelic" gene 
marker map which consists of 60,000-100,000 polymorphic or variable sites on the human 
genome, each of which has two variants). Such a high-resolution genetic map can be 
compared to a map of the genome of each of a statistically substantial number of subjects 
taking part in a Phase II/III drug trial in order to identify genes associated with a particular 
observed drug response or side effect. Alternatively, such a high resolution map can be 
generated from a combination of some ten-million known single nucleotide polymorphisms 
(SNPs) in the human genome. A "SNP" is a common alteration that occurs in a single 
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nucleotide base in a stretch of DNA. For example, a SNP may occur once per every 1000 
bases of DNA. A SNP may be involved in a disease process. However, the vast majority of 
SNPs may be not related to diseases. Given a genetic map based on the occurrence of 
SNPs, individuals can be grouped into genetic categories depending on a particular pattern 
of SNPs in their individual genome. In such a manner, treatment regimens can be tailored 
to groups of genetically similar individuals, taking into account traits that may be common 
among such genetically similar individuals. Thus, mapping of the NRHK1 gene to SNP 
maps of patients with NRHKl-related diseases may facilitate the identification of drug- 
response-prediction genes. 

[0230] Alternatively, the "candidate gene approach" can be utilized to identify genes 
that predict drug response. According to this method, if a gene that encodes a drug target is 
known, all common variants of that gene can be easily identified in the population. It then 
can be determined if a particular drug response is associated with one version of the gene 
versus another. 

[0231] The activity of drug metabolizing enzymes is a major determinant of both the 
intensity and duration of drug action. The discovery of genetic polymorphisms of drug 
metabolizing enzymes (e.g., N-acetyltransferase 2 (NAT 2) and cytochrome P450 enzymes 
CYP2D6 an d CYPZC19) has provided an explanation as to why some subjects do not 
obtain the expected drug effects or show exaggerated drug response and serious toxicity 
after taking the standard and safe dose of a drug. These polymorphisms are expressed in 
two phenotypes in the population, extensive metabolizer and poor metabolizer. The 
prevalence of poor metabolizer phenotypes is different among different populations. For 
example, the gene coding for CYP2D6 is highly polymorphic and several mutations have 
been identified in poor metabolizers, which all lead to the absence of functional CYP2D6. 
Poor metabolizers of CYP2D6 and CYP2C19 quite frequently experience exaggerated drug 
response and side effects when they receive standard doses. If a metabolite is the active 
therapeutic moiety, poor metabolizers show no therapeutic response. The other extreme are 
the so called ultra-rapid metabolizers who do not respond to standard doses. Recently, the 
molecular basis of ultra-rapid metabolism has been identified to be due to CYP2D6 gene 
amplification. 

[0232] In one embodiment, the "gene expression profiling" method can be utilized to 
identify genes that predict drug response. In this regard, the gene expression profile of an 
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animal dosed with a drug can give an indication of whether the gene pathways related to 
toxicity have been turned on. 

10233] Information generated from the above pharmacogenomics approaches can be 
used to determine the appropriate dosage or treatment regimen suitable for a particular 
individual. This knowledge can avoid adverse reactions or therapeutic failure, and therefore 
enhance therapeutic or prophylactic efficiency when treating a subject with an NRHKl 
modulator. 

Therapeutic Methods 

[0234] As described above, the present invention includes therapeutic methods for 
treating a subject at risk for, susceptible to, or diagnosed with NRHKl-related diseases. 
The therapeutic methods can be individually tailored based on the subject's drug response 
genotype. Typically, the therapeutic methods comprise modulating the expression or 
activity of NRHKl in the subject. In one embodiment, the method comprises contacting a 
plurality of cells in the subject with an agent that inhibits the expression or activity of 
NRHKl . Suitable agents include polynucleotides (e.g., an antisense oligonucleotides of 
NRHKl), polypeptides (e.g., a dominant negative mutant of NRHKl), or polysaccharides, 
naturally-occurring target molecules of NRHKl protein (e.g., an NRHKl protein substrate 
or receptor), anti-NRHKl antibodies, NRHKl antagonists, or other small organic and 
inorganic molecule. They may also include vectors comprising polynucleotides encoding 
NRHKl inhibitors or antisense sequences. Moreover, the agents can be anti-NRHKl 
antibodies conjugated with therapeutic moieties. Suitable agents can be identified using the 
screening assays of the present invention. 
Pharmaceutical C ompositions 

[0235] The present invention is further directed to pharmaceutical compositions 
comprising an NRHKl modulator and a pharmaceutical^ acceptable carrier. As used 
herein, a "pharmaceutical^ acceptable carrier" is intended to include any and all solvents, 
solubilizers, fillers, stabilizers, binders, absorbents, bases, buffering agents, lubricants, 
controlled release vehicles, diluents, emulsifying agents, humectants, lubricants, disperse 
media, coatings, antibacterial or antifungal agents, isotonic and absorption delaying agents, 
and the like, compatible with pharmaceutical administration. The use of such media and 
agents for pharmaceutical^ active substances is well-known in the art. Except insofar as 
any conventional media or agent is incompatible with the active compound, use thereof in 
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the compositions is contemplated. Supplementary agents can also be incorporated into the 
compositions. 

[02361 A pharmaceutical composition of the invention is formulated to be compatible 
with its intended route of administration. Examples of routes of administration include 
parenteral, e.g., intravenous, intradermal, subcutaneous, oral (e.g., inhalation), transdermal 
(topical), transmucosal, and rectal administration. Solutions or suspensions used for 
parenteral, intradermal, or subcutaneous application can include the following components: 
a sterile diluent such as water for injection, saline solution, fixed oils, polyethylene glycols, 

glycerine; propylene glycol or other synthetic solvents; antibacterial agents such as benzyl 

alcohol or methyl parabens; antioxidants such as ascorbic acid or sodium bisulfate; 

chelating agents such as ethylenediaminetetraacetic acid; buffers such as acetates, citrates or 

phosphates and agents for the adjustment of tonicity such as sodium chloride or dextrose. 

pH can be adjusted with acids or bases, such as hydrochloric acid or sodium hydroxide. 

The parenteral preparation can be enclosed in ampoules, disposable syringes or multiple 

dose vials made of glass or plastic. 

[0237] Pharmaceutical compositions suitable for injectable use include sterile aqueous 
solutions or dispersions and sterile powders for the extemporaneous preparation of sterile 
injectable solutions or dispersion. For intravenous administration, suitable carriers include 
physiological saline, bacteriostatic water, Cremophor EL™ (BASF, Parsippany, NJ) or 
phosphate buffered saline (PBS). In all cases, the injectable composition should be sterile 
and should be fluid to the extent that easy syringability exists. It must be stable under the 
conditions of manufacture and storage and must be preserved against the contaminating 
action of microorganisms such as bacteria and fungi. The carrier can be a solvent or 
dispersion medium containing, for example, water, ethanol, polyol (for example, glycerol, 
propylene glycol, and liquid polyethylene glycol, and the like), and suitable mixtures 
thereof. The proper fluidity can be maintained, for example, by the use of a coating such as 
lecithin, by the maintenance of the required particle size in the case of dispersion and by the 
use of surfactants. Prevention of the action of microorganisms can be achieved by various 
antibacterial and antifungal agents, for example, parabens, chlorobutanol, phenol, ascorbic 
acid, thimerosal, and the like. In many cases, it will be preferable to include isotonic agents, 
for example, sugars, polyalcohols such as manitol, sorbitol, sodium chloride in the 
composition. Prolonged absorption of the injectable compositions can be brought about by 
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including in the composition an agent which delays absorption, for example, aluminum 
monostearate and gelatin. 

[0238] Sterile injectable solutions can be prepared by incorporating the active 
modulator (e.g., an anti-NRHKl antibody, an NRHK1 activity inhibitor, or a gene therapy 
vector expressing antisense nucleotide to NRHK1) in the required amount in an appropriate 
solvent, followed by filtered sterilization. Generally, dispersions are prepared by 
incorporating the active compound into a sterile vehicle which contains a basic dispersion 
medium and the required other ingredients from those enumerated above. In the case of 
sterile powders for the preparation of sterile injectable solutions, the preferred methods of 
preparation are vacuum drying and freeze-drying which yields a powder of the active, 
ingredient plus any additional desired ingredient from a previously sterile-filtered solution 
thereof. 

[0239] Oral compositions generally include an inert diluent or an edible carrier. They 
can be enclosed in gelatin capsules or compressed into tablets. For the purpose of oral 
therapeutic administration, the active compound can be incorporated with excipients and 
used in the form of tablets, troches, or capsules. Oral compositions can also be prepared 
using a fluid carrier for use as a mouthwash, wherein the compound in the fluid carrier is 
applied orally and swished and expectorated or swallowed. Pharmaceutically compatible 
binding agents, and/or adjuvant materials can be included as part of the composition. The 
tablets, pills, capsules, troches and the like can contain any of the following ingredients, or 
compounds of a similar nature: a binder such as microcrystalline cellulose, gum tragacanth 
or gelatin; an excipient such as starch or lactose, a disintegrating agent such as alginic acid, 
Primogel, or corn starch; a lubricant such as magnesium stearate or Stertes; a glidant such as 
colloidal silicon dioxide; a sweetening agent such as sucrose or saccharin; or a flavoring 
agent such as peppermint, methyl salicylate, or orange flavoring. 

[0240] For administration by inhalation, the compounds are delivered in the form of an 
aerosol spray from pressured container or dispenser which contains a suitable propellant, 
e.g., a gas such as carbon dioxide, or a nebulizer. 

[0241] Systemic administration can also be by transmucosal or transdermal means. For 
transmucosal or transdermal administration, penetrants appropriate to the barrier to be 
permeated are used in the formulation. Such penetrants are generally known in the art, and 
include, for example, for transmucosal administration, detergents, bile salts, and fusidic acid 
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derivatives. Transmucosal administration can be accomplished through the use of nasal 
sprays or suppositories. For transdermal administration, the bioactive compounds are 
formulated into ointments, salves, gels, or creams as generally known in the art. 
[0242] The compounds can also be prepared in the form of suppositories (e.g., with 
conventional suppository bases such as cocoa butter and other glycerides) or retention 
enemas for rectal delivery. 

[0243] In one embodiment, the therapeutic moieties, which may contain a bioactive 
compound, are prepared with carriers that will protect the compound against rapid 
elimination from the body, such as a controlled release formulation, including implants and 
microencapsulated delivery systems. Biodegradable, biocompatible polymers can be used, 
such as ethylene vinyl acetate, polyanhydrides, polyglycolic acid, collagen, polyorthoesters, 
and polylactic acid. Methods for preparation of such formulations will be apparent to those 
skilled in the art. The materials can also be obtained commercially from e.g. Alza 
Corporation and Nova Pharmaceuticals, Inc. Liposomal suspensions (including liposomes 
targeted to infected cells with monoclonal antibodies to viral antigens) can also be used as 
pharmaceutical^ acceptable carriers. These can be prepared according to methods known 
to those skilled in the art, for example, as described in U.S. Patent No. 4,522,8 1 1 . 
[0244] It is especially advantageous to formulate oral or parenteral compositions in 
dosage unit form for ease of administration and uniformity of dosage. Dosage unit form as 
used herein includes physically discrete units suited as unitary dosages for the subject to be 
treated; each unit containing a predetermined quantity of active compound calculated to 
produce the desired therapeutic effect in association with the required pharmaceutical 
carrier. The specification for the dosage unit forms of the invention are dictated by and 
directly dependent on the unique characteristics of the active compound and the particular 
therapeutic effect to be achieved, and the limitations inherent in the art of compounding 
such an active compound for the treatment of individuals. 

[0245] Toxicity and therapeutic efficacy of such compounds can be determined by 
standard pharmaceutical procedures in cell cultures or experimental animals, e.g., for 
determining the LD50 (the dose lethal to 50% of the population) and the ED50 (the dose 
therapeutically effective in 50% of the population). The dose ratio between toxic and 
therapeutic effects is the therapeutic index and it can be expressed as the ratio LD50/ED50. 
Compounds which exhibit large therapeutic indices are preferred. While compounds that 
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exhibit toxic side effects may be used, care should be taken to design a delivery system that 
targets such compounds to the site of affected tissue in order to minimize potential damage 
to uninfected cells and, thereby, reduce side effects. 

[0246] The data obtained from the cell culture assays and animal studies can be used in 
formulating a range of dosage for use in humans. The dosage of such compounds lies 
preferably within a range of circulating concentrations that include the ED50 with little or 
no toxicity. The dosage may vary within this range depending upon the dosage form 
employed and the route of administration utilized. For any compound used in the method of 
the invention, the therapeutically effective dose can be estimated initially from cell culture 
assays. A dose may be formulated in animal models to achieve a circulating plasma 
concentration range that includes the IC50 (i.e., the concentration of the test compound 
which achieves a half-maximal inhibition of symptoms) as determined in cell culture. Such 
information can be used to more accurately determine useful doses in humans. Levels in 
plasma may be measured, for example, by high performance liquid chromatography. 
[0247] The pharmaceutical compositions can be included in a container, pack, or 
dispenser together with instructions for administration. 

Kits 

[0248] The invention also encompasses kits for detecting the presence of an NRHK1 
gene product in a biological sample. An example kit comprises reagents for assessing 
expression of NRHK1 at mRNA or protein level. Preferably, the reagents include an 
antibody or fragment thereof, wherein the antibody or fragment specifically binds to 
NRHK1 . Optionally, the kits may comprise a polynucleotide probe capable of specifically 
binding to a transcript of the NRHK1 gene. The kit may also contain means for determining 
the amount of NRHK1 protein or mRNA in the test sample, and/or means for comparing the 
amount of NRHK1 protein or mRNA in the test sample to a control or standard. The 
compound or agent can be packaged in a suitable container. 

[0249] The invention further provides kits for assessing the suitability of each of a 
plurality of compounds for inhibiting NRHK1 -related diseases in cells or human subjects. 
Such kits include a plurality of compounds to be tested, and a reagent (such as an antibody 
specific to NRHK1 proteins, or a polynucleotide probe or primer capable of hybridizing to 
the NRHK1 gene) for assessing expression of NRHK1 . 
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[0250] It should be understood that the above-described embodiments are given by way 
of illustration, not limitation. Various changes and modifications within the scope of the 
present invention will become apparent to those skilled in the art from the present 
description. 
Host Cells 

[0251] Another aspect of the invention pertains to host cells into which a 
polynucleotide molecule of the invention is introduced, e.g., an NRHK1 gene or homolog 
thereof, within an expression vector, a gene delivery vector, or a polynucleotide molecule of 
the invention containing sequences which allow it to homologously recombine into a 
specific site of the host cell's genome. The terms "host cell" and "recombinant host cell- 
are used interchangeably herein. It is understood that such terms refer not only to the 
particular subject cell but to the progeny or potential progeny of such a cell. Because 
certain modifications may occur in succeeding generations due to either mutation or 
environmental influences, such progeny may not, in fact, be identical to the parent cell, but 
are still included within the scope of the term as used herein. 

[0252] A host cell can be any prokaryotic or eukaryotic cell. For example, an NRHK1 
gene can be expressed in bacterial cells such as E. coli, insect cells, yeast or mammalian 
cells (such as Chinese hamster ovary cells (CHO), COS cells, Fischer 344 rat cells, 
HLA-B27 rat cells, HeLa cells, A549 cells, or 293 cells). Other suitable host cells are 
known to those skilled in the art. 

[0253] Vector DNA can be introduced into prokaryotic or eukaryotic cells via 
conventional transformation or transfection techniques. As used herein, the terms 
"transformation" and "transfection" are intended to refer to a variety of art-recognized 
techniques for introducing foreign polynucleotide (e.g., DNA) into a host cell, including 
calcium phosphate or calcium chloride co-precipitation, DAKD-dextran-mediated 
transfection, lipofection, or electoporation. 

[0254] For stable transfection of mammalian cells, it is known that, depending upon the 
expression vector and transfection technique used, only a small fraction of cells may 
integrate the foreign DNA into their genome. In order to identify and select these 
integrants, a gene that encodes a selectable flag (e.g., resistance to antibiotics) is generally 
introduced into the host cells along with the gene of interest. Preferred selectable flags 
include those which confer resistance to drugs, such as G418, hygromycin and 
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methotrexate. A Polynucleotide encoding a selectable flag can be introduced into a host cell 
by the same vector as that encoding NRHK1 or can be introduced by a separate vector. 
Cells stably transfected with the introduced polynucleotide can be identified by drug 
selection (e.g., cells that have incorporated the selectable flag gene will survive, while the 
other cells die). 

[0255] A host cell of the invention, such as a prokaryotic or eukaryotic host cell in 
culture, can be used to produce (i.e., express) NRHK1 . Accordingly, the invention further 
provides methods for producing NRHK1 using the host cells of the invention. In one 
embodiment, the method comprises culturing the host cell of invention (into which a 
recombinant expression vector containing an NRHK1 gene has been introduced) in a 
suitable medium such that NRHK1 is produced. In another embodiment, the method further 
comprises isolating NRHK1 from the medium or the host cell. 
Transgenic and Knockout Animals 

[0256] The host cells of the invention can also be used to produce non-human 
transgenic animals. For example, in one embodiment, a host cell of the invention is a 
fertilized oocyte or an embryonic stem cell into which NRHK1 -coding sequences have been 
introduced. Such host cells can then be used to create non-human transgenic animals in 
which exogenous sequences encoding NRHK1 have been introduced into their genome or 
homologous recombinant animals in which endogenous sequences encoding NRHK1 have 
been altered. Such animals are useful for studying the function and/or activity of NRHK1 
and for identifying and/or evaluating modulators of NRHK1 activity. As used herein, a 
"transgenic animal" is a non-human animal, preferably a mammal, more preferably a rodent 
such as a rat or mouse, in which one or more of the cells of the animal includes a transgene. 
Other examples of transgenic animals include non-human primates, sheep, dogs, cows, 
goats, chickens, amphibians, and the like. A transgene is exogenous DNA which is 
integrated into the genome of a cell from which a transgenic animal develops and which 
remains in the genome of the mature animal, thereby directing the expression of an encoded 
gene product in one or more cell types or tissues of the transgenic animal. As used herein, a 
"homologous recombinant animal" or "knockout animal" is a non-human animal, preferably 
a mammal, more preferably a mouse, in which an endogenous NRHK1 gene has been 
altered by homologous recombination between the endogenous gene and an exogenous 
DNA molecule introduced into a cell of the animal, e.g., an embryonic cell of the animal, 
prior to development of the animal. 
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[0257] A transgenic animal of the invention can be created by introducing an NRHK1 - 
encoding polynucleotide into the mate pronuclei of a fertilized oocyte, e.g., by 
microinjection or retroviral infection, and allowing the oocyte to develop in a 
pseudopregnant female foster animal. Intronic sequences and polyadenylation signals can 
also be included in the transgene to increase the efficiency of expression of the transgene. 
A tissue-specific regulatory sequence(s) can be operably linked to a transgene to direct 
expression of NRHK1 to particular cells. Methods for generating transgenic animals via 
embryo manipulation and microinjection, particularly animals such as mice, have become 
conventional in the art. Similar methods are used for production of other transgenic 
animals. A transgenic founder animal can be identified based upon the presence of a 
transgene of the invention in its genome and/or expression of mRNA corresponding to a 
gene of the invention in tissues or cells of the animals. A transgenic founder animal can 
then be used to breed additional animals carrying the transgene. Moreover, transgenic 
animals carrying a transgene encoding NRHK1 can further be bred to other transgenic 
animals carrying other transgenes. 

[0258] To create a homologous recombinant animal (knockout animal), a vector is 
prepared which contains at least a portion of a gene of the invention into which a deletion, 
addition or substitution has been introduced to thereby alter, e.g., functionally disrupt, the 
gene. The gene can be a human gene, but more preferably, is a non-human homolog of a 
human gene of the invention (e.g., a homolog of the NRHK1 gene). For example, a mouse 
gene can be used to construct a homologous recombination polynucleotide molecule, e.g., a 
vector, suitable for altering an endogenous gene of the invention in the mouse genome. In a 
preferred embodiment, the homologous recombination polynucleotide molecule is designed 
such that, upon homologous recombination, the endogenous gene of the invention is 
functionally disrupted (i.e., no longer encodes a functional protein; also referred to as a 
"knockout" vector). Alternatively, the homologous recombination polynucleotide molecule 
can be designed such that, upon homologous recombination, the endogenous gene is 
mutated or otherwise altered but still encodes functional protein (e.g., the upstream 
regulatory region can be altered to thereby alter the expression of the endogenous NRHK1 
gene). In the homologous recombination polynucleotide molecule, the altered portion of the 
gene of the invention is flanked at its 5' and 3' ends by additional polynucleotide sequence 
of the gene of the invention to allow for homologous recombination to occur between the 
exogenous gene carried by the homologous recombination polynucleotide molecule and an 
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endogenous gene in a cell, e.g., an embryonic stem cell. The additional flanking 
polynucleotide sequence is of sufficient length for successful homologous recombination 
with the endogenous gene. 

[0259] Typically, several kilobases of flanking DN A (both at the 5 ' and 3 ' ends) are 
included in the homologous recombination polynucleotide molecule. The homologous 
recombination polynucleotide molecule is introduced into embryonic stem cells by 
electroporation. The cells in which the introduced gene has homologously recombined with 
the endogenous gene are selected. The selected cells can then be injected into a blastocyst 
of an animal (e.g., a mouse) to form aggregation chimeras. A chimeric embryo can then be 
implanted into a suitable pseudopregnant female foster animal and the embryo brought to 
term. Progeny harboring the homologously recombined DNA in their germ cells can be 
used to breed animals in which all cells of the animal contain the homologously recombined 
DNA by germline transmission of the homologously recombined DNA. Methods for 
constructing homologous recombination polynucleotide molecules, e.g., vectors, or 
homologous recombinant animals are well known in the art. 

[0260] In another embodiment, transgenic non-human animals can be produced which 
contain selected systems which allow for regulated expression of the transgene. One 
example of such a system is the cre/loxP recombinase system of bacteriophage PI. Another 
example of a recombinase system is the FLP recombinase system of Saccharomyces 
cerevisiae (see e.g., O'Gorman et al., Science 251:1351-1355, 1991). If a cre/loxP 
recombinase system is used to regulate expression of the transgene, animals containing 
transgenes encoding both the Cre recombinase and a selected protein are required. Such 
animals can be provided through the construction of "double" transgenic animals, e.g., by 
mating two transgenic animals, one containing a transgene encoding a selected protein and 
the other containing a transgene encoding a recombinase. 

[0261] Clones of the non-human transgenic animals described herein can also be 
produced according to the methods described in Wilmut, I. et al., Nature 385:810-813, 
1997, and PCT International Publication Nos. WO97/07668 and WO97/07669. In brief, a 
cell, e.g., a somatic cell, from the transgenic animal can be isolated and induced to exit the 
growth cycle and enter GO phase. The quiescent cell can then be fused, e.g., through the use 
of electrical pulses, to an enucleated oocyte from an animal of the same species from which 
the quiescent cell is isolated. The reconstructed oocyte is then cultured such that it develops 



68 



to morula or blastocyte and then transferred to pseudopregnant female foster animal. The 
offspring borne of this female foster animal will be a clone of the animal from which the 
cell, e.g., the somatic cell, is isolated. 
Examples 

Example 1: Identification ofNRHKl Sequence in Human Genome Database 
[0262] The nucleic acid sequence ofNRHKl is obtained from a newly developed 
genomic prediction pipeline. Briefly, the X-ray crystal structures of the catalytic domains 
of protein kinases were collected and aligned together according to their structural 
identity/similarities. The alignment was converted into a "scoring matrix" which carried the 
structural profile of the kinase catalytic domains. This scoring matrix was then used to 
search the Celera Human Genome database for sequences that have kinase catalytic 
domains. 

Example 2: BLAST Analysis 

[0263] Sequence alignments between NRHK1 and other sequences in GenBank 
database were performed using the standard protein-protein BLAST(blastp), standard 
nucleotide-nucleotide BLAST(blastn), BLAST2 Sequences, and human genome BLAST 
programs that are available at NCBI's BLAST website. 

[0264] A standard protein-protein BLAST search in the "nr" database (available at 
NCBI's BLAST website) with "Filter" setting unchecked, "Expect" setting at 10.0, "Word 
Size" setting at 3, "Matrix" setting at BLOSUM62, "Gap costs" setting at Existence^ and 
Extension:!, identified partial amino acid sequence similarities between NRHK1 and a 
number of proteins. These proteins include, but are not limited to, Populus x canescen 
NIMA-related protein kinase (Entrez accession number: AF469649, 28% alignment to 
amino acid residues 25 - 285 ofNRHKl), L. esculentum LSTK-l-like kinase (Entrez 
accession number: AF079103, 27% alignment to amino acid residues 25 - 321 ofNRHKl), 
C. elegans hypothetical protein T07A9.3 (Entrez accession number: AF036706, 25% 
alignment to amino acid residues 27 - 307 ofNRHKl), and human NIMA-related kinase 1 
(Entrez accession number: XM_048605, 24% alignment to amino acid residues 25 - 297 of 
NRHK1). 

[0265] A conserved domain search was performed within the standard protein-protein 
BLAST search with the RPS-BLAST 2.2.3 [Apr-24-2002] program. The amino acid 
residues 28 - 297 ofNRHKl share high homologies to the consensus sequences of the 
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catalytic domain of ser/thr protein kinase (Entrez accession number: smart00220, 
100.0%alignment), the kinase domain of pkinase (Entrez accession number: pfam00069, 
100.0% alignment), and the catalytic domain of tyrosine kinase (Entrez accession number: 
smart00219, 87.5% alignment). 

[0266] A standard nucleotide-nucleotide BLAST search in database nr (available at 
NCBI's BLAST website) with "Filter" setting unchecked, "Expect" setting at 10.0, "Word 
Size" setting at 3, identified only one nucleotide sequence: a putative human cDNA 
LOC169436 (XM_095696, SEQ ID NO:4) that showed significant homology to nucleotides 
88 - 174 (100% identities), 174 - 583 (100% identities), and 575 - 2403 of NRHK1 (99% 
identities). 

[0267] A standard nucleotide-nucleotide BLAST search in the "pat" database (available 
at NCBI's BLAST website) with "Filter" setting unchecked, "Expect" setting at 10.0, "Word 
Size" setting at 3, identified significant nucleotide sequence similarities between NRHK1 
with a human protein kinase-like protein SGK071 (Entrez accession number: AX056458, 
SEQ ID NOS:5 and 6), which was disclosed in PCT patent application WO00/73469. 
Further analysis using pairwise BLAST algorithm revealed that NRHK1 and SGK071 share 
84% sequence identities at the amino acid level (blastp, matrix: BLOSUM62, gap open: 1 1 , 
Gap extension: 1, x_dropoff: 50, expect: 10.0, wordsize: 3, filter: unchecked), and 90% 
sequence identities at nucleotide level (blastn, match: 1 , mismatch: -2, gap open: 5, gap 
extension: 0, x_dropoff: 50, expect: 10.0, wordsize: 1 1 , filter: unchecked). 
[0268] A human genome search was carried out using blastn program with Expect 
setting at 0.01, Filter setting at default, Descriptions setting at 100, and Alignment settings 
at 100. The NRHK1 gene was mapped to locus 9q34 of human chromosome 9. 
Specifically, NRHK1 gene is located between genes LOC157890 and LOC57109, and 
overlaps with gene LOC169436 on chromosome 9. All the twenty-one exons of the 
NRHK1 gene were mapped to nucleotides 110001 to 137201 in human chromosome 9 of 
the Entrez Human Genome Sequence Database maintained by NCBI. The exons were also 
mapped to Celera genomic database (SEQ ID NO:3). The exons/introns in the NRHK1 
gene were determined using the program "sim4" described by Florea et al. in "A computer 
program for aligning a cDNA sequence with a genomic DNA sequence." Genome Res. 
8:967-974, 1998. 



70 



Example 3: Hydrophobicity Analysis 

10269] The hydrophobicity profile of NRHK1 sequence (FIGURE 5) was generated 
using the GES (Goldman, Engelman and Steitz) hydrophobicity scale (Engelman, D. M, 
Steitz, T.A. and Goldman, A. 1986. Identifying nonpolar transmembrane helices in amino 
acid sequences of membrane proteins. Ann. Rev. Biophys. Biophys.Chem. 15:321-353, 
1986). Briefly, the GES scale is used to identify nonpolar transbilayer helices. The curve is 
the average of a residue-specific hydrophobicity scale over a window of 20 residues. When 
the line is in the upper half of the frame (positive), it indicates a hydrophobic region and 
when it is in the lower half (negative), a hydrophilic region. 

[0270] In FIGURE 5, the X-axis represents the length of the protein in amino acids 
(aa), while the Y-axis represents the GES score. The curve line shows the GES pattern of 
the entire protein, while the strait line represents certain cutoff for potential membrane 
spanning domains. The hydrophobicity profile indicates that NRHK1 is probably not a 
membrane protein. 

[0271] Having described the preferred embodiments of compositions, organisms and 
methodologies employing a novel human gene NRHK1 (which are intended to be 
illustrative and not limiting), it is noted that modifications and variations can be made by 
persons skilled in the art in light of the above teachings. Therefore, it is understood that 
changes may be made in the particular embodiments disclosed which are within the scope 
of what is described as defined by the appended claims. 
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